Ross Wightman
|
b57a03bd0d
|
Merge branch 'master' into bits_and_tpu
|
4 years ago |
Ross Wightman
|
91ab0b6ce5
|
Add proper TrainState checkpoint save/load. Some reorg/refactoring and other cleanup. More to go...
|
4 years ago |
Ross Wightman
|
02320c3e3d
|
Bump version to 0.4.11
|
4 years ago |
Ross Wightman
|
bda8ab015a
|
Remove min channels for SelectiveKernel, divisor should cover cases well enough.
|
4 years ago |
Ross Wightman
|
a27f4aec4a
|
Missed args for skresnext w/ refactoring.
|
4 years ago |
Ross Wightman
|
307a935b79
|
Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
|
4 years ago |
Ross Wightman
|
8bf63b6c6c
|
Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
|
4 years ago |
Ross Wightman
|
bcec14d3b5
|
Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
|
4 years ago |
Ross Wightman
|
9611458e19
|
Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
|
4 years ago |
Ross Wightman
|
01b9108619
|
Merge branch 'master' into more_attn
|
4 years ago |
Ross Wightman
|
d7bab8a6c5
|
Fix strict flag change for checkpoint load.
|
4 years ago |
Ross Wightman
|
02f9d4bc34
|
Add weights for resnet51q model, add 61q def.
|
4 years ago |
Ross Wightman
|
f615474be3
|
Fix broken test, repvgg block doesn't have attn_last attr.
|
4 years ago |
Ross Wightman
|
742c2d5247
|
Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
|
4 years ago |
Ross Wightman
|
9c78de8c02
|
Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
|
4 years ago |
Ross Wightman
|
5db7452173
|
Fix visformer in_chans stem handling
|
4 years ago |
Ross Wightman
|
318360c3f9
|
Update README.md before merge. Bump version to 0.4.10
|
4 years ago |
Ross Wightman
|
11ae795e99
|
Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
|
4 years ago |
Ross Wightman
|
d400f1dbdd
|
Filter test models before creation for backward/torchscript tests
|
4 years ago |
Ross Wightman
|
c4572cc5aa
|
Add Visformer-small weighs, tweak torchscript jit test img size.
|
4 years ago |
Ross Wightman
|
bfc72f75d3
|
Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
|
4 years ago |
Ross Wightman
|
18bf520ad1
|
Add eca_nfnet_l2/l3 defs for future training
|
4 years ago |
Ross Wightman
|
f45de37690
|
Merge branch 'master' into levit_visformer_rednet
|
4 years ago |
Ross Wightman
|
23c18a33e4
|
Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
|
4 years ago |
Ross Wightman
|
5b9c69e80a
|
Add basic training resume based on legacy code
|
4 years ago |
Ross Wightman
|
c2ba229d99
|
Prep for effcientnetv2_rw_m model weights that started training before official release..
|
4 years ago |
Ross Wightman
|
30b9880d06
|
Minor adjustment, mutable default arg, extra check of valid len...
|
4 years ago |
Ross Wightman
|
be0abfbcce
|
Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
|
4 years ago |
Ross Wightman
|
b7de82e835
|
ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
|
4 years ago |
Ross Wightman
|
306c86b668
|
Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
|
4 years ago |
Ross Wightman
|
a569635045
|
Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
|
4 years ago |
Ross Wightman
|
be99eef9c1
|
Remove redundant code, cleanup, fix torchscript.
|
4 years ago |
Ross Wightman
|
5ab372a3ec
|
Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
|
4 years ago |
Aman Arora
|
5db1eb6ba5
|
Add defaults
|
4 years ago |
Aman Arora
|
8b1f2e8e1f
|
remote unused matplotlib import
|
4 years ago |
Aman Arora
|
40c506ba1e
|
Add ConViT
|
4 years ago |
Alexander Soare
|
7976019864
|
extend positional embedding resizing functionality to tnt
|
4 years ago |
Alexander Soare
|
8086943b6f
|
allow resize positional embeddings to non-square grid
|
4 years ago |
talrid
|
dc1a4efd28
|
mixer_b16_224_miil, mixer_b16_224_miil_in21k models
|
4 years ago |
Ross Wightman
|
4210d922d2
|
Merge branch 'master' into bits_and_tpu
|
4 years ago |
李鑫杰
|
7b799c4e79
|
add latest code
|
4 years ago |
Ross Wightman
|
72ca831dd4
|
Back to using strings for the enum translation, forgot about import dep
|
4 years ago |
Ross Wightman
|
d5af752117
|
Add preliminary gMLP and ResMLP impl to Mlp-Mixer
|
4 years ago |
Ross Wightman
|
cbd4ee737f
|
Fix model init for XLA, remove some prints.
|
4 years ago |
李鑫杰
|
00548b8427
|
Add Twins
|
4 years ago |
Ross Wightman
|
74d2829341
|
Merge branch 'master' into bits_and_tpu
|
4 years ago |
Ross Wightman
|
aa92d7b1c5
|
Major timm.bits update. Updater and DeviceEnv now dataclasses, after_step closure used, metrics base impl w/ distributed reduce, many tweaks/fixes.
|
4 years ago |
Ross Wightman
|
e7f0db8664
|
Fix drop/drop_path arg on MLP-Mixer model. Fix #641
|
4 years ago |
Ross Wightman
|
9a3ae97311
|
Another set of byoanet models w/ ECA channel + SA + groups
|
4 years ago |
Ross Wightman
|
d53e91218e
|
Fix tf.data options setting for newer TF versions
|
4 years ago |