Commit Graph

1147 Commits (135a48d02454ed987223727e6bca5cb8c1223caa)
 

Author SHA1 Message Date
Alexander Soare b11d949a06 wip checkpoint with some feature extraction work
4 years ago
Alexander Soare 23bb72ce5e nested_transformer wip
4 years ago
Ross Wightman 7919053425
Merge pull request #729 from bryant1410/patch-3
4 years ago
Santiago Castro 49b38a51e3
Add color highlighting to BibTeX entry in README
4 years ago
Ross Wightman 7096b52a61 Remove sotabench files, no longer working / maintained
4 years ago
Ross Wightman d10b071a28 Update results csvs w/ latest ViT, ResMLP, and NfNet-L2 weights present
4 years ago
Ross Wightman 766b4d3262 Fix features for resnetv2_50t
4 years ago
Ross Wightman e8045e712f Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
4 years ago
Ross Wightman 02aaa785b9
Update README.md
4 years ago
Ross Wightman 7606bdf9e8
Merge pull request #714 from rwightman/vit_and_bit_test_fixes
4 years ago
Ross Wightman 20a2be14c3 Add gMLP-S weights, 79.6 top-1
4 years ago
Ross Wightman 85f894e03d Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
4 years ago
Ross Wightman b41cffaa93 Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
4 years ago
Ross Wightman dc422820ec
Update README.md
4 years ago
Ross Wightman 79927baaec
Merge pull request #702 from rwightman/cleanup_xla_model_fixes
4 years ago
Ross Wightman 9c9755a808 AugReg release
4 years ago
Ross Wightman 381b279785 Add hybrid model fwds back
4 years ago
Ross Wightman 26f04a8e3e Fix a weight link
4 years ago
Ross Wightman 8f4a0222ed Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
4 years ago
Ross Wightman 4c09a2f169 Bump version 0.4.12
4 years ago
Ross Wightman b319eb5b5d Update ViT weights, more details to be added before merge.
4 years ago
Ross Wightman 8257b86550 Fix up resnetv2 bit/bitm model default res
4 years ago
Ross Wightman 1228f5a3d8 Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
4 years ago
Ross Wightman 511a8e8c96 Add official ResMLP weights.
4 years ago
Ross Wightman b9cfb64412 Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
4 years ago
Ross Wightman 8319e0c373 Add file docstring to std_conv.py
4 years ago
Ross Wightman 0020268d9b Try lower max size for non_std default_cfg test
4 years ago
Ross Wightman 4d96165989 Merge branch 'master' into cleanup_xla_model_fixes
4 years ago
Ross Wightman 8880f696b6 Refactoring, cleanup, improved test coverage.
4 years ago
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
4 years ago
Ross Wightman 07fb05cc3d Update results csv files
4 years ago
Ross Wightman b79dfd4fc2
Merge pull request #693 from SamuelGabriel/patch-1
4 years ago
SamuelGabriel 7c19c35d9f
Global instead of local rank.
4 years ago
Ross Wightman b7a568f065 Fix torchscript issue in bat
4 years ago
Ross Wightman d17b374f0f Minimum input_size needed to be higher
4 years ago
Ross Wightman b3b90d944d Add min_input_size to bat_resnext to prevent test breakage.
4 years ago
Ross Wightman 758c4438a7 Update README.md
4 years ago
Ross Wightman d413eef1bf Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
4 years ago
Ross Wightman 10d8fa4620 Add gc and bat attention resnext26ts variants to byob for test.
4 years ago
Ross Wightman 2f5ed2dec1 Update `init_values` const for 24 and 36 layer ResMLP models
4 years ago
Ross Wightman 8e4ac3549f All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
4 years ago
Ross Wightman 2a63d0246b Post merge cleanup
4 years ago
Ross Wightman 45dec179e5
Merge pull request #681 from lmk123568/master
4 years ago
Ross Wightman 4907f8f70d
Merge pull request #685 from dyhan0920/master
4 years ago
Dongyoon Han ded1671483 Fix stochastic depth working only with a shortcut
4 years ago
Mike b87d98b238
Update convit.py
4 years ago
Ross Wightman 54a6cca27a
Merge pull request #668 from rwightman/more_attn
4 years ago
Ross Wightman 02320c3e3d Bump version to 0.4.11
4 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
4 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
4 years ago