Alexander Soare
23bb72ce5e
nested_transformer wip
3 years ago
Ross Wightman
7919053425
Merge pull request #729 from bryant1410/patch-3
...
Add color highlighting to BibTeX entry in README
3 years ago
Santiago Castro
49b38a51e3
Add color highlighting to BibTeX entry in README
3 years ago
Ross Wightman
7096b52a61
Remove sotabench files, no longer working / maintained
3 years ago
Ross Wightman
d10b071a28
Update results csvs w/ latest ViT, ResMLP, and NfNet-L2 weights present
3 years ago
Ross Wightman
766b4d3262
Fix features for resnetv2_50t
3 years ago
Ross Wightman
e8045e712f
Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
3 years ago
Ross Wightman
02aaa785b9
Update README.md
3 years ago
Ross Wightman
7606bdf9e8
Merge pull request #714 from rwightman/vit_and_bit_test_fixes
...
Fix a few issues loading pretrained vit/bit npz weights...
3 years ago
Ross Wightman
20a2be14c3
Add gMLP-S weights, 79.6 top-1
3 years ago
Ross Wightman
85f894e03d
Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
3 years ago
Ross Wightman
b41cffaa93
Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
3 years ago
Ross Wightman
dc422820ec
Update README.md
3 years ago
Ross Wightman
79927baaec
Merge pull request #702 from rwightman/cleanup_xla_model_fixes
...
AugReg Vision Transformers, XLA model compat for ResNetV2-BiT / NFNet, ECA-NFNet-L2, GMixer-24 weights, ResMLP official weights, and cleanup
3 years ago
Ross Wightman
9c9755a808
AugReg release
3 years ago
Ross Wightman
381b279785
Add hybrid model fwds back
3 years ago
Ross Wightman
26f04a8e3e
Fix a weight link
3 years ago
Ross Wightman
8f4a0222ed
Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
3 years ago
Ross Wightman
4c09a2f169
Bump version 0.4.12
3 years ago
Ross Wightman
b319eb5b5d
Update ViT weights, more details to be added before merge.
3 years ago
Ross Wightman
8257b86550
Fix up resnetv2 bit/bitm model default res
3 years ago
Ross Wightman
1228f5a3d8
Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
3 years ago
Ross Wightman
511a8e8c96
Add official ResMLP weights.
3 years ago
Ross Wightman
b9cfb64412
Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
3 years ago
Ross Wightman
8319e0c373
Add file docstring to std_conv.py
3 years ago
Ross Wightman
0020268d9b
Try lower max size for non_std default_cfg test
3 years ago
Ross Wightman
4d96165989
Merge branch 'master' into cleanup_xla_model_fixes
3 years ago
Ross Wightman
8880f696b6
Refactoring, cleanup, improved test coverage.
...
* Add eca_nfnet_l2 weights, 84.7 @ 384x384
* All 'non-std' (ie transformer / mlp) models have classifier / default_cfg test added
* Fix #694 reset_classifer / num_features / forward_features / num_classes=0 consistency for transformer / mlp models
* Add direct loading of npz to vision transformer (pure transformer so far, hybrid to come)
* Rename vit_deit* to deit_*
* Remove some deprecated vit hybrid model defs
* Clean up classifier flatten for conv classifiers and unusual cases (mobilenetv3/ghostnet)
* Remove explicit model fns for levit conv, just pass in arg
3 years ago
Ross Wightman
ba2ca4b464
One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman
07fb05cc3d
Update results csv files
3 years ago
Ross Wightman
b79dfd4fc2
Merge pull request #693 from SamuelGabriel/patch-1
...
Let only the _globally_ 0th rank write checkpoints in `train.py`
3 years ago
SamuelGabriel
7c19c35d9f
Global instead of local rank.
3 years ago
Ross Wightman
b7a568f065
Fix torchscript issue in bat
3 years ago
Ross Wightman
d17b374f0f
Minimum input_size needed to be higher
3 years ago
Ross Wightman
b3b90d944d
Add min_input_size to bat_resnext to prevent test breakage.
3 years ago
Ross Wightman
758c4438a7
Update README.md
3 years ago
Ross Wightman
d413eef1bf
Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
3 years ago
Ross Wightman
10d8fa4620
Add gc and bat attention resnext26ts variants to byob for test.
3 years ago
Ross Wightman
2f5ed2dec1
Update `init_values` const for 24 and 36 layer ResMLP models
3 years ago
Ross Wightman
8e4ac3549f
All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
3 years ago
Ross Wightman
2a63d0246b
Post merge cleanup
3 years ago
Ross Wightman
45dec179e5
Merge pull request #681 from lmk123568/master
...
Update convit.py
3 years ago
Ross Wightman
4907f8f70d
Merge pull request #685 from dyhan0920/master
...
Update rexnet.py
3 years ago
Dongyoon Han
ded1671483
Fix stochastic depth working only with a shortcut
3 years ago
Mike
b87d98b238
Update convit.py
...
Cut out the duplicates
3 years ago
Ross Wightman
54a6cca27a
Merge pull request #668 from rwightman/more_attn
...
Add Gather-Excite, Global Context, BAT, Non-Local attn modules and refactored all attn modules and factory for improved consistency. EfficientNet / MobileNetV3 backbones able to use a wider variety of attention modules.
4 years ago
Ross Wightman
02320c3e3d
Bump version to 0.4.11
4 years ago
Ross Wightman
bda8ab015a
Remove min channels for SelectiveKernel, divisor should cover cases well enough.
4 years ago
Ross Wightman
a27f4aec4a
Missed args for skresnext w/ refactoring.
4 years ago
Ross Wightman
307a935b79
Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
4 years ago