Commit Graph

943 Commits (bcec14d3b585d7b5c469705f99fb2d830bdcdb7d)
 

Author SHA1 Message Date
Ross Wightman bcec14d3b5 Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
3 years ago
Ross Wightman 9611458e19 Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
3 years ago
Ross Wightman 01b9108619 Merge branch 'master' into more_attn
3 years ago
Ross Wightman d7bab8a6c5 Fix strict flag change for checkpoint load.
3 years ago
Ross Wightman 02f9d4bc34 Add weights for resnet51q model, add 61q def.
3 years ago
Ross Wightman f615474be3 Fix broken test, repvgg block doesn't have attn_last attr.
3 years ago
Ross Wightman 742c2d5247 Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
3 years ago
Ross Wightman 9c78de8c02 Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
3 years ago
Ross Wightman 07d952c7a7
Merge pull request #637 from rwightman/levit_visformer_rednet
3 years ago
Ross Wightman 7f368782b7
Merge pull request #660 from petervandenabeele/readme_fix_typos
3 years ago
Ross Wightman 5db7452173 Fix visformer in_chans stem handling
3 years ago
Peter Vandenabeele 51c432150a README: fix simple typos
3 years ago
Ross Wightman fd92ba0de8 Filter large vit models from torchscript tests
3 years ago
Ross Wightman 318360c3f9 Update README.md before merge. Bump version to 0.4.10
3 years ago
Ross Wightman 99d97e0d67 Hopefully the last test update for this PR...
3 years ago
Ross Wightman 11ae795e99 Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
3 years ago
Ross Wightman d400f1dbdd Filter test models before creation for backward/torchscript tests
3 years ago
Ross Wightman c4572cc5aa Add Visformer-small weighs, tweak torchscript jit test img size.
3 years ago
Ross Wightman 83487e2a0d Lower max backward size for tests.
3 years ago
Ross Wightman bfc72f75d3 Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
3 years ago
Ross Wightman 18bf520ad1 Add eca_nfnet_l2/l3 defs for future training
3 years ago
Ross Wightman f45de37690 Merge branch 'master' into levit_visformer_rednet
3 years ago
Ross Wightman 23c18a33e4 Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
3 years ago
Ross Wightman c2ba229d99 Prep for effcientnetv2_rw_m model weights that started training before official release..
3 years ago
Ross Wightman 22f7c6760f Merge branch 'alexander-soare-master'
3 years ago
Ross Wightman 30b9880d06 Minor adjustment, mutable default arg, extra check of valid len...
3 years ago
Ross Wightman be0abfbcce Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
3 years ago
Ross Wightman d4c00d6150 Merge branch 'amaarora-convit'
3 years ago
Ross Wightman b7de82e835 ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
3 years ago
Ross Wightman 306c86b668 Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
3 years ago
Ross Wightman 6e04da0c54 Merge branch 'abcdvzz-master'
3 years ago
Ross Wightman a569635045 Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
3 years ago
Ross Wightman be99eef9c1 Remove redundant code, cleanup, fix torchscript.
3 years ago
Ross Wightman 5ab372a3ec Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
3 years ago
Aman Arora 50d6aab0ef Add convit to non-std filters as vit_
3 years ago
Aman Arora 5db1eb6ba5 Add defaults
3 years ago
Aman Arora 9b97753f9c Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into convit
3 years ago
Aman Arora 1633317489 update tests and exclude convit_base
3 years ago
Aman Arora 8b1f2e8e1f remote unused matplotlib import
3 years ago
Aman Arora 40c506ba1e Add ConViT
3 years ago
Ross Wightman b4ebf9263e
Merge pull request #651 from mrT23/master
3 years ago
Alexander Soare 7976019864 extend positional embedding resizing functionality to tnt
3 years ago
Alexander Soare 8086943b6f allow resize positional embeddings to non-square grid
3 years ago
talrid dc1a4efd28 mixer_b16_224_miil, mixer_b16_224_miil_in21k models
3 years ago
mrT23 709d7c07e8 Merge branch 'rwightman:master' into master
3 years ago
talrid 240e667746 Revert "mixer_b16_224_miil"
3 years ago
李鑫杰 d046498e0b update test_models.py
3 years ago
李鑫杰 7b799c4e79 add latest code
3 years ago
Ross Wightman cf4ce2f15b
Merge pull request #647 from rwightman/more_mlp
3 years ago
Ross Wightman b3b503c13d
Merge pull request #626 from cvcomar/master
3 years ago