Commit Graph

555 Commits (6b2d9c2660cde4fce1ef9b86ae1806c07b8368e7)

Author SHA1 Message Date
Ross Wightman 6b2d9c2660 Another bits/README.md update
3 years ago
Ross Wightman c3db5f5801 Worker hack for TFDS eval, add TPU env var setting.
3 years ago
Ross Wightman f411724de4 Fix checkpoint delete issue. Add README about bits and initial Pytorch XLA usage on TPU-VM. Add some FIXMEs and fold train_cfg into train_state by default.
3 years ago
Ross Wightman b57a03bd0d Merge branch 'master' into bits_and_tpu
4 years ago
Ross Wightman 91ab0b6ce5 Add proper TrainState checkpoint save/load. Some reorg/refactoring and other cleanup. More to go...
4 years ago
Ross Wightman 02320c3e3d Bump version to 0.4.11
4 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
4 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
4 years ago
Ross Wightman 307a935b79 Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
4 years ago
Ross Wightman 8bf63b6c6c Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
4 years ago
Ross Wightman bcec14d3b5 Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
4 years ago
Ross Wightman 9611458e19 Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
4 years ago
Ross Wightman 01b9108619 Merge branch 'master' into more_attn
4 years ago
Ross Wightman d7bab8a6c5 Fix strict flag change for checkpoint load.
4 years ago
Ross Wightman 02f9d4bc34 Add weights for resnet51q model, add 61q def.
4 years ago
Ross Wightman f615474be3 Fix broken test, repvgg block doesn't have attn_last attr.
4 years ago
Ross Wightman 742c2d5247 Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
4 years ago
Ross Wightman 9c78de8c02 Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
4 years ago
Ross Wightman 5db7452173 Fix visformer in_chans stem handling
4 years ago
Ross Wightman 318360c3f9 Update README.md before merge. Bump version to 0.4.10
4 years ago
Ross Wightman 11ae795e99 Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
4 years ago
Ross Wightman d400f1dbdd Filter test models before creation for backward/torchscript tests
4 years ago
Ross Wightman c4572cc5aa Add Visformer-small weighs, tweak torchscript jit test img size.
4 years ago
Ross Wightman bfc72f75d3 Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
4 years ago
Ross Wightman 18bf520ad1 Add eca_nfnet_l2/l3 defs for future training
4 years ago
Ross Wightman f45de37690 Merge branch 'master' into levit_visformer_rednet
4 years ago
Ross Wightman 23c18a33e4 Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
4 years ago
Ross Wightman 5b9c69e80a Add basic training resume based on legacy code
4 years ago
Ross Wightman c2ba229d99 Prep for effcientnetv2_rw_m model weights that started training before official release..
4 years ago
Ross Wightman 30b9880d06 Minor adjustment, mutable default arg, extra check of valid len...
4 years ago
Ross Wightman be0abfbcce Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
4 years ago
Ross Wightman b7de82e835 ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
4 years ago
Ross Wightman 306c86b668 Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
4 years ago
Ross Wightman a569635045 Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
4 years ago
Ross Wightman be99eef9c1 Remove redundant code, cleanup, fix torchscript.
4 years ago
Ross Wightman 5ab372a3ec Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
4 years ago
Aman Arora 5db1eb6ba5 Add defaults
4 years ago
Aman Arora 8b1f2e8e1f remote unused matplotlib import
4 years ago
Aman Arora 40c506ba1e Add ConViT
4 years ago
Alexander Soare 7976019864 extend positional embedding resizing functionality to tnt
4 years ago
Alexander Soare 8086943b6f allow resize positional embeddings to non-square grid
4 years ago
talrid dc1a4efd28 mixer_b16_224_miil, mixer_b16_224_miil_in21k models
4 years ago
Ross Wightman 4210d922d2 Merge branch 'master' into bits_and_tpu
4 years ago
李鑫杰 7b799c4e79 add latest code
4 years ago
Ross Wightman 72ca831dd4 Back to using strings for the enum translation, forgot about import dep
4 years ago
Ross Wightman d5af752117 Add preliminary gMLP and ResMLP impl to Mlp-Mixer
4 years ago
Ross Wightman cbd4ee737f Fix model init for XLA, remove some prints.
4 years ago
李鑫杰 00548b8427 Add Twins
4 years ago
Ross Wightman 74d2829341 Merge branch 'master' into bits_and_tpu
4 years ago
Ross Wightman aa92d7b1c5 Major timm.bits update. Updater and DeviceEnv now dataclasses, after_step closure used, metrics base impl w/ distributed reduce, many tweaks/fixes.
4 years ago