Commit Graph

954 Commits (ba2ca4b46440c9fcf579fc66ca6df3082db44475)
 

Author SHA1 Message Date
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman 2f5ed2dec1 Update `init_values` const for 24 and 36 layer ResMLP models
3 years ago
Ross Wightman 8e4ac3549f All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
3 years ago
Ross Wightman 54a6cca27a
Merge pull request #668 from rwightman/more_attn
3 years ago
Ross Wightman 02320c3e3d Bump version to 0.4.11
3 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
3 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
3 years ago
Ross Wightman 307a935b79 Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
3 years ago
Ross Wightman 17dc47c8e6 Missed comma in test filters.
3 years ago
Ross Wightman 34522097b1 See if we can use tcmalloc in test runner
3 years ago
Ross Wightman 8bf63b6c6c Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
3 years ago
Ross Wightman bcec14d3b5 Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
3 years ago
Ross Wightman 9611458e19 Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
3 years ago
Ross Wightman 01b9108619 Merge branch 'master' into more_attn
3 years ago
Ross Wightman d7bab8a6c5 Fix strict flag change for checkpoint load.
3 years ago
Ross Wightman 02f9d4bc34 Add weights for resnet51q model, add 61q def.
3 years ago
Ross Wightman f615474be3 Fix broken test, repvgg block doesn't have attn_last attr.
3 years ago
Ross Wightman 742c2d5247 Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
3 years ago
Ross Wightman 9c78de8c02 Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
3 years ago
Ross Wightman 07d952c7a7
Merge pull request #637 from rwightman/levit_visformer_rednet
3 years ago
Ross Wightman 7f368782b7
Merge pull request #660 from petervandenabeele/readme_fix_typos
3 years ago
Ross Wightman 5db7452173 Fix visformer in_chans stem handling
3 years ago
Peter Vandenabeele 51c432150a README: fix simple typos
3 years ago
Ross Wightman fd92ba0de8 Filter large vit models from torchscript tests
3 years ago
Ross Wightman 318360c3f9 Update README.md before merge. Bump version to 0.4.10
3 years ago
Ross Wightman 99d97e0d67 Hopefully the last test update for this PR...
3 years ago
Ross Wightman 11ae795e99 Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
3 years ago
Ross Wightman d400f1dbdd Filter test models before creation for backward/torchscript tests
3 years ago
Ross Wightman c4572cc5aa Add Visformer-small weighs, tweak torchscript jit test img size.
4 years ago
Ross Wightman 83487e2a0d Lower max backward size for tests.
4 years ago
Ross Wightman bfc72f75d3 Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
4 years ago
Ross Wightman 18bf520ad1 Add eca_nfnet_l2/l3 defs for future training
4 years ago
Ross Wightman f45de37690 Merge branch 'master' into levit_visformer_rednet
4 years ago
Ross Wightman 23c18a33e4 Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
4 years ago
Ross Wightman c2ba229d99 Prep for effcientnetv2_rw_m model weights that started training before official release..
4 years ago
Ross Wightman 22f7c6760f Merge branch 'alexander-soare-master'
4 years ago
Ross Wightman 30b9880d06 Minor adjustment, mutable default arg, extra check of valid len...
4 years ago
Ross Wightman be0abfbcce Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
4 years ago
Ross Wightman d4c00d6150 Merge branch 'amaarora-convit'
4 years ago
Ross Wightman b7de82e835 ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
4 years ago
Ross Wightman 306c86b668 Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
4 years ago
Ross Wightman 6e04da0c54 Merge branch 'abcdvzz-master'
4 years ago
Ross Wightman a569635045 Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
4 years ago
Ross Wightman be99eef9c1 Remove redundant code, cleanup, fix torchscript.
4 years ago
Ross Wightman 5ab372a3ec Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
4 years ago
Aman Arora 50d6aab0ef Add convit to non-std filters as vit_
4 years ago
Aman Arora 5db1eb6ba5 Add defaults
4 years ago
Aman Arora 9b97753f9c Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into convit
4 years ago
Aman Arora 1633317489 update tests and exclude convit_base
4 years ago
Aman Arora 8b1f2e8e1f remote unused matplotlib import
4 years ago