Commit Graph

994 Commits (49b38a51e3b97a7c97c8b382f6bd63025e037993)
 

Author SHA1 Message Date
Santiago Castro 49b38a51e3
Add color highlighting to BibTeX entry in README
3 years ago
Ross Wightman 7096b52a61 Remove sotabench files, no longer working / maintained
3 years ago
Ross Wightman d10b071a28 Update results csvs w/ latest ViT, ResMLP, and NfNet-L2 weights present
3 years ago
Ross Wightman 766b4d3262 Fix features for resnetv2_50t
3 years ago
Ross Wightman e8045e712f Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
3 years ago
Ross Wightman 02aaa785b9
Update README.md
3 years ago
Ross Wightman 7606bdf9e8
Merge pull request #714 from rwightman/vit_and_bit_test_fixes
3 years ago
Ross Wightman 20a2be14c3 Add gMLP-S weights, 79.6 top-1
3 years ago
Ross Wightman 85f894e03d Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
3 years ago
Ross Wightman b41cffaa93 Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
3 years ago
Ross Wightman dc422820ec
Update README.md
3 years ago
Ross Wightman 79927baaec
Merge pull request #702 from rwightman/cleanup_xla_model_fixes
3 years ago
Ross Wightman 9c9755a808 AugReg release
3 years ago
Ross Wightman 381b279785 Add hybrid model fwds back
3 years ago
Ross Wightman 26f04a8e3e Fix a weight link
3 years ago
Ross Wightman 8f4a0222ed Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
3 years ago
Ross Wightman 4c09a2f169 Bump version 0.4.12
3 years ago
Ross Wightman b319eb5b5d Update ViT weights, more details to be added before merge.
3 years ago
Ross Wightman 8257b86550 Fix up resnetv2 bit/bitm model default res
3 years ago
Ross Wightman 1228f5a3d8 Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
3 years ago
Ross Wightman 511a8e8c96 Add official ResMLP weights.
3 years ago
Ross Wightman b9cfb64412 Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
3 years ago
Ross Wightman 8319e0c373 Add file docstring to std_conv.py
3 years ago
Ross Wightman 0020268d9b Try lower max size for non_std default_cfg test
3 years ago
Ross Wightman 4d96165989 Merge branch 'master' into cleanup_xla_model_fixes
3 years ago
Ross Wightman 8880f696b6 Refactoring, cleanup, improved test coverage.
3 years ago
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman 07fb05cc3d Update results csv files
3 years ago
Ross Wightman b79dfd4fc2
Merge pull request #693 from SamuelGabriel/patch-1
3 years ago
SamuelGabriel 7c19c35d9f
Global instead of local rank.
3 years ago
Ross Wightman b7a568f065 Fix torchscript issue in bat
3 years ago
Ross Wightman d17b374f0f Minimum input_size needed to be higher
3 years ago
Ross Wightman b3b90d944d Add min_input_size to bat_resnext to prevent test breakage.
3 years ago
Ross Wightman 758c4438a7 Update README.md
3 years ago
Ross Wightman d413eef1bf Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
3 years ago
Ross Wightman 10d8fa4620 Add gc and bat attention resnext26ts variants to byob for test.
3 years ago
Ross Wightman 2f5ed2dec1 Update `init_values` const for 24 and 36 layer ResMLP models
3 years ago
Ross Wightman 8e4ac3549f All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
3 years ago
Ross Wightman 2a63d0246b Post merge cleanup
3 years ago
Ross Wightman 45dec179e5
Merge pull request #681 from lmk123568/master
3 years ago
Ross Wightman 4907f8f70d
Merge pull request #685 from dyhan0920/master
3 years ago
Dongyoon Han ded1671483 Fix stochastic depth working only with a shortcut
3 years ago
Mike b87d98b238
Update convit.py
3 years ago
Ross Wightman 54a6cca27a
Merge pull request #668 from rwightman/more_attn
3 years ago
Ross Wightman 02320c3e3d Bump version to 0.4.11
3 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
3 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
3 years ago
Ross Wightman 307a935b79 Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
3 years ago
Ross Wightman 17dc47c8e6 Missed comma in test filters.
3 years ago
Ross Wightman 34522097b1 See if we can use tcmalloc in test runner
3 years ago