Commit Graph

903 Commits (6e559e9b5fb3db657f68727a90adf89603172cfe)

Author SHA1 Message Date
Ross Wightman a426511c95 More optimizer cleanup. Change all to no longer use .data. Improve (b)float16 use with adabelief. Add XLA compatible Lars.
3 years ago
Ross Wightman 9541f4963b One more scalar -> tensor fix for lamb optimizer
3 years ago
Ross Wightman 8f68193c91
Update lamp.py comment
3 years ago
Ross Wightman 4d284017b8
Merge pull request #813 from rwightman/opt_cleanup
3 years ago
Ross Wightman a6af48be64 add madgradw optimizer
3 years ago
Ross Wightman 55fb5eedf6 Remove experiment from lamb impl
3 years ago
Ross Wightman 8a9eca5157 A few optimizer comments, dead import, missing import
3 years ago
Ross Wightman ac469b50da Optimizer improvements, additions, cleanup
3 years ago
Sepehr Sameni abf3e044bb
Update scheduler_factory.py
3 years ago
Ross Wightman 3cdaf5ed56 Add `mmax` config key to auto_augment for increasing upper bound of RandAugment magnitude beyond 10. Make AugMix uniform sampling default not override config setting.
3 years ago
Ross Wightman 1042b8a146 Add non fused LAMB optimizer option
3 years ago
Ross Wightman 01cb46a9a5 Add gc_efficientnetv2_rw_t weights (global context instead of SE attn). Add TF XL weights even though the fine-tuned ones don't validate that well. Change default arg for GlobalContext to use scal (mul) mode.
3 years ago
Ross Wightman d3f7440650 Add EfficientNetV2 XL model defs
3 years ago
Ross Wightman 72b227dcf5
Merge pull request #750 from drjinying/master
3 years ago
Ross Wightman 2907c1f967
Merge pull request #746 from samarth4149/master
3 years ago
Ross Wightman 748ab852ca Allow act_layer switch for xcit, fix in_chans for some variants
3 years ago
Ying Jin 20b2d4b69d Use bicubic interpolation in resize_pos_embed()
3 years ago
Ross Wightman d3255adf8e Merge branch 'xcit' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-xcit
3 years ago
Ross Wightman f8039c7492 Fix gc effv2 model cfg name
3 years ago
Alexander Soare 3a55a30ed1 add notes from author
3 years ago
Alexander Soare 899cf84ccc bug fix - missing _dist postfix for many of the 224_dist models
3 years ago
Alexander Soare 623e8b8eb8 wip xcit
3 years ago
Ross Wightman 392368e210 Add efficientnetv2_rw_t defs w/ weights, and gc variant, as well as gcresnet26ts for experiments. Version 0.4.13
3 years ago
samarth daab57a6d9 1. Added a simple multi step LR scheduler
3 years ago
Ross Wightman 6d8272e92c Add SAM pretrained model defs/weights for ViT B16 and B32 models.
3 years ago
Ross Wightman ee4d8fc69a Remove unecessary line from nest post refactor
3 years ago
Ross Wightman 8165cacd82 Realized LayerNorm2d won't work in all cases as is, fixed.
3 years ago
Ross Wightman 81cd6863c8 Move aggregation (convpool) for nest into NestLevel, cleanup and enable features_only use. Finalize weight url.
3 years ago
Ross Wightman 6ae0ac6420 Merge branch 'nested_transformer' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-nested_transformer
3 years ago
Alexander Soare 7b8a0017f1 wip to review
3 years ago
Alexander Soare b11d949a06 wip checkpoint with some feature extraction work
3 years ago
Alexander Soare 23bb72ce5e nested_transformer wip
3 years ago
Ross Wightman 766b4d3262 Fix features for resnetv2_50t
3 years ago
Ross Wightman e8045e712f Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
3 years ago
Ross Wightman 20a2be14c3 Add gMLP-S weights, 79.6 top-1
3 years ago
Ross Wightman 85f894e03d Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
3 years ago
Ross Wightman b41cffaa93 Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
3 years ago
Ross Wightman 9c9755a808 AugReg release
3 years ago
Ross Wightman 381b279785 Add hybrid model fwds back
3 years ago
Ross Wightman 26f04a8e3e Fix a weight link
3 years ago
Ross Wightman 8f4a0222ed Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
3 years ago
Ross Wightman 4c09a2f169 Bump version 0.4.12
3 years ago
Ross Wightman b319eb5b5d Update ViT weights, more details to be added before merge.
3 years ago
Ross Wightman 8257b86550 Fix up resnetv2 bit/bitm model default res
3 years ago
Ross Wightman 1228f5a3d8 Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
3 years ago
Ross Wightman 511a8e8c96 Add official ResMLP weights.
3 years ago
Ross Wightman b9cfb64412 Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
3 years ago
Ross Wightman 8319e0c373 Add file docstring to std_conv.py
3 years ago
Ross Wightman 4d96165989 Merge branch 'master' into cleanup_xla_model_fixes
3 years ago
Ross Wightman 8880f696b6 Refactoring, cleanup, improved test coverage.
3 years ago
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman b7a568f065 Fix torchscript issue in bat
3 years ago
Ross Wightman d17b374f0f Minimum input_size needed to be higher
3 years ago
Ross Wightman b3b90d944d Add min_input_size to bat_resnext to prevent test breakage.
3 years ago
Ross Wightman d413eef1bf Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
3 years ago
Ross Wightman 10d8fa4620 Add gc and bat attention resnext26ts variants to byob for test.
3 years ago
Ross Wightman 2f5ed2dec1 Update `init_values` const for 24 and 36 layer ResMLP models
3 years ago
Ross Wightman 8e4ac3549f All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
3 years ago
Ross Wightman 2a63d0246b Post merge cleanup
3 years ago
Ross Wightman 45dec179e5
Merge pull request #681 from lmk123568/master
3 years ago
Dongyoon Han ded1671483 Fix stochastic depth working only with a shortcut
3 years ago
Mike b87d98b238
Update convit.py
3 years ago
Ross Wightman 02320c3e3d Bump version to 0.4.11
3 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
3 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
3 years ago
Ross Wightman 307a935b79 Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
3 years ago
Ross Wightman 8bf63b6c6c Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
3 years ago
Ross Wightman bcec14d3b5 Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
3 years ago
Ross Wightman 9611458e19 Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
3 years ago
Ross Wightman 01b9108619 Merge branch 'master' into more_attn
3 years ago
Ross Wightman d7bab8a6c5 Fix strict flag change for checkpoint load.
3 years ago
Ross Wightman 02f9d4bc34 Add weights for resnet51q model, add 61q def.
3 years ago
Ross Wightman f615474be3 Fix broken test, repvgg block doesn't have attn_last attr.
3 years ago
Ross Wightman 742c2d5247 Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
3 years ago
Ross Wightman 9c78de8c02 Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
3 years ago
Ross Wightman 5db7452173 Fix visformer in_chans stem handling
3 years ago
Ross Wightman 318360c3f9 Update README.md before merge. Bump version to 0.4.10
3 years ago
Ross Wightman 11ae795e99 Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
3 years ago
Ross Wightman d400f1dbdd Filter test models before creation for backward/torchscript tests
3 years ago
Ross Wightman c4572cc5aa Add Visformer-small weighs, tweak torchscript jit test img size.
3 years ago
Ross Wightman bfc72f75d3 Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
3 years ago
Ross Wightman 18bf520ad1 Add eca_nfnet_l2/l3 defs for future training
3 years ago
Ross Wightman f45de37690 Merge branch 'master' into levit_visformer_rednet
3 years ago
Ross Wightman 23c18a33e4 Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
3 years ago
Ross Wightman c2ba229d99 Prep for effcientnetv2_rw_m model weights that started training before official release..
3 years ago
Ross Wightman 30b9880d06 Minor adjustment, mutable default arg, extra check of valid len...
3 years ago
Ross Wightman be0abfbcce Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
3 years ago
Ross Wightman b7de82e835 ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
3 years ago
Ross Wightman 306c86b668 Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
3 years ago
Ross Wightman a569635045 Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
3 years ago
Ross Wightman be99eef9c1 Remove redundant code, cleanup, fix torchscript.
3 years ago
Ross Wightman 5ab372a3ec Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
3 years ago
Aman Arora 5db1eb6ba5 Add defaults
3 years ago
Aman Arora 8b1f2e8e1f remote unused matplotlib import
3 years ago
Aman Arora 40c506ba1e Add ConViT
3 years ago
Alexander Soare 7976019864 extend positional embedding resizing functionality to tnt
3 years ago
Alexander Soare 8086943b6f allow resize positional embeddings to non-square grid
3 years ago
talrid dc1a4efd28 mixer_b16_224_miil, mixer_b16_224_miil_in21k models
3 years ago
李鑫杰 7b799c4e79 add latest code
3 years ago
Ross Wightman d5af752117 Add preliminary gMLP and ResMLP impl to Mlp-Mixer
3 years ago
李鑫杰 00548b8427 Add Twins
3 years ago
Ross Wightman e7f0db8664 Fix drop/drop_path arg on MLP-Mixer model. Fix #641
3 years ago
Ross Wightman 9a3ae97311 Another set of byoanet models w/ ECA channel + SA + groups
3 years ago
Ross Wightman d53e91218e Fix tf.data options setting for newer TF versions
3 years ago
Ross Wightman 7077f16c6a Change 21k model naming from _21k to _in21k for consistency with existing 21k models.
3 years ago
Ross Wightman 94d4b53352 Add temporary default_cfgs to visformer models so they pass tests
3 years ago
Ross Wightman 3bffc701f1 Merge branch 'master' into levit_visformer_rednet
3 years ago
Ross Wightman ecc7552c5c Add levit, levit_c, and visformer model defs. Largely untested and not finished cleanup.
3 years ago
Ross Wightman 165fb354b2 Add initial RedNet model / Involution layer impl for testing
3 years ago
Ross Wightman 328249f11a Update README, tweak fine-tune effv2 model names.
3 years ago
Ross Wightman c4f482a08b EfficientNetV2 official impl w/ weights ported from TF. Cleanup/refactor of related EfficientNet classes and models.
3 years ago
Ross Wightman 4fbc32d3d0 Fix crop_pct for cait models.
3 years ago
Ross Wightman 715519a5ef Rethink name of patch embed grid info
3 years ago
Ross Wightman b2c305c2aa Move Mlp and PatchEmbed modules into layers. Being used in lots of models now...
3 years ago
Ross Wightman 3ba6b55cb2 More adjustments to ByoaNet models for further experiments.
3 years ago
Ross Wightman 5fcddb96a8 Merge branch 'master' into cait
3 years ago
Ross Wightman 3db12b4b6a Finish CaiT cleanup
3 years ago
Ross Wightman 2d8b09fe8b Add official pretrained weights to MLP-Mixer, complete model cfgs.
3 years ago
Ross Wightman 12efffa6b1 Initial MLP-Mixer attempt...
3 years ago
Ross Wightman 0721559511 Improved (hopefully) init for SA/SA-like layers used in ByoaNets
3 years ago
Ross Wightman d5473c17f7 Fix incorrect name of shortcut/identity paths in many residual nets. Inherited from naming in old old torchvision, long fixed there.
3 years ago
Ross Wightman 0d87650fea Remove filter hack from BlurPool w/ non-persistent buffer. Use BlurPool2d instead of AntiAliasing.. for TResNet. Breaks PyTorch < 1.6.
3 years ago
Ross Wightman ddc743fdf8 Update ResNet-RS models to EMA weights
3 years ago
Ross Wightman 08d60f4a9a resnetrs50 pool sizing wrong
3 years ago
Ross Wightman 1daa15ecc3 Initial Cait commit. Still some cleanup to do.
3 years ago
Ross Wightman 67d0665b46 Post ResNet-RS merge cleanup. Add weight urls, adjust train/test/crop pct.
3 years ago
Aman Arora 560eae38f5
[WIP] Add ResNet-RS models (#554)
3 years ago
Ross Wightman 9cc7dda6e5 Fixup byoanet configs to pass unit tests. Add swin_attn and swinnet26t model for testing.
3 years ago
Ross Wightman e15c3886ba Defaul lambda r=7. Define '26t' stage 4/5 256x256 variants for all of bot/halo/lambda nets for experiment. Add resnet50t for exp. Fix a few comments.
3 years ago
Ross Wightman e5e15754c9 Fix coat first conv ident
3 years ago
Ross Wightman 76739a7589 CoaT merge. Bit of formatting, fix torchscript (for non features), remove einops/einsum dep, add pretrained weight hub (url) support.
3 years ago
Ross Wightman 026430c083 Merge branch 'master' of https://github.com/morizin/pytorch-image-models-1 into morizin-master
3 years ago
Ross Wightman a0492e3b48 A few miil weights naming tweaks to improve compat with model registry and filtering wildcards.
3 years ago
talrid 8c1f03e56c comment
3 years ago
talrid 19e1b67a84 old spaces
3 years ago
talrid a443865876 update naming and scores
3 years ago
talrid cf0e371594 84_0
3 years ago
talrid 0968bdeca3 vit, tresnet and mobilenetV3 ImageNet-21K-P weights
3 years ago
morizin 1e3b6d4dfc
Update __init__.py
3 years ago
morizin fd022fd6a2
Update __init__.py
3 years ago
morizin c2d5087eae
Add files via upload
3 years ago
Norman Mu 79640fcc1f Enable uniform augmentation magnitude sampling and set AugMix default
3 years ago
Ross Wightman c1cf9712fc Add updated EfficientNet-V2S weights, 83.8 @ 384x384 test. Add PyTorch trained EfficientNet-B4 weights, 83.4 @ 384x384 test. Tweak non TF EfficientNet B1-B4 train/test res scaling.
3 years ago
Ross Wightman e8a64fb881 Test input size for efficientnet_v2s was wrong in last results run
3 years ago
Ross Wightman 2df77ee5cb Fix torchscript compat and features_only behaviour in GhostNet PR. A few minor formatting changes. Reuse existing layers.
3 years ago
Ross Wightman d793deb51a Merge branch 'master' of https://github.com/iamhankai/pytorch-image-models into iamhankai-master
3 years ago
Ross Wightman e685618f45
Merge pull request #550 from amaarora/wandb
3 years ago
Ross Wightman f606c45c38 Add Swin Transformer models from https://github.com/microsoft/Swin-Transformer
3 years ago
iamhankai de445e7827 Add GhostNet
3 years ago
Ross Wightman 5a196dddf6 Update README.md with latest, bump version to 0.4.8
3 years ago
Ross Wightman b3d7580df1 Update ByoaNet comments. Fix first Steam feat chs for ByobNet.
3 years ago
Ross Wightman 16f7aa9f54 Add default_cfg options for min_input_size / fixed_input_size, queries in model registry, and use for testing self-attn models
3 years ago
Ross Wightman 4e4b863b15 Missed norm.py
3 years ago
Ross Wightman 7c97e66f7c Remove commented code, add more consistent seed fn
3 years ago
Ross Wightman 364dd6a58e Merge branch 'master' into byoanet-self_attn
3 years ago
Ross Wightman ce62f96d4d ByoaNet with bottleneck transformer, lambda resnet, and halo net experiments
3 years ago
Ross Wightman cd3dc4979f Fix adabelief imports, remove prints, preserve memory format is the default arg for zeros_like
3 years ago
Ross Wightman 21812d33aa Add prelim efficientnet_v2s weights from 224x224 train, eval 83.3 @ 288. Add eca_nfnet_l1 weights, train at 256, eval 84 @ 320.
3 years ago
Aman Arora 5772c55c57 Make wandb optional
3 years ago
Aman Arora f54897cc0b make wandb not required but rather optional as huggingface_hub
3 years ago
Aman Arora 3f028ebc0f import wandb in summary.py
4 years ago
Aman Arora 624c9b6949 log to wandb only if using using wandb
4 years ago
juntang addfc7c1ac adabelief
4 years ago
Ross Wightman fb896c0b26 Update some comments re preliminary EfficientNet-V2 assumptions
4 years ago
Ross Wightman 2b49ab7a36 Fix ResNetV2 pretrained classifier issue. Fixes #540
4 years ago
Ross Wightman de9dff933a EfficientNet-V2S preliminary model def (for experimentation)
4 years ago
Ross Wightman 37c71a5609 Some further create_optimizer_v2 tweaks, remove some redudnant code, add back safe model str. Benchmark step times per batch.
4 years ago
Ross Wightman 2bb65bd875 Wrong default_cfg pool_size for L1
4 years ago
Ross Wightman bf2ca6bdf4 Merge jax and original weight init
4 years ago
Ross Wightman acbd698c83 Update README.md with updates. Small tweak to head_dist handling.
4 years ago
Ross Wightman 9071568f0e Add weights for SE NFNet-L0 model, rename nfnet_l0b -> nfnet_l0. 82.75 top-1 @ 288. Add nfnet_l1 model def for training.
4 years ago
Ross Wightman c468c47a9c Add regnety_160 weights from DeiT teacher model, update that and my regnety_032 weights to use higher test size.
4 years ago
Ross Wightman 288682796f Update benchmark script to add precision arg. Fix some downstream (DeiT) compat issues with latest changes. Bump version to 0.4.7
4 years ago
Ross Wightman ea9c9550b2 Fully move ViT hybrids to their own file, including embedding module. Remove some extra DeiT models that were for benchmarking only.
4 years ago
Ross Wightman a5310a3451 Merge remote-tracking branch 'origin/benchmark-fixes-vit_hybrids' into pit_and_vit_update
4 years ago
Ross Wightman 7953e5d11a Fix pos_embed scaling for ViT and num_classes != 1000 for pretrained distilled deit and pit models. Fix #426 and fix #433
4 years ago
Ross Wightman a760a4c3f4 Some ViT cleanup, merge distilled model with main, fixup torchscript support for distilled models
4 years ago
Ross Wightman 0dfc5a66bb Add PiT model from https://github.com/naver-ai/pit
4 years ago
Ross Wightman 51febd869b Small tweak to tests for tnt model, reorder model imports.
4 years ago
Ross Wightman b27a4e0d88 Merge branch 'master' of https://github.com/contrastive/pytorch-image-models into contrastive-master
4 years ago
Aman Arora 6b18061773 Add GIST to docstring for quick access
4 years ago
contrastive de86314655 Update TNT
4 years ago
Aman Arora 92b1db9a79 update docstrings and add check on and
4 years ago
Aman Arora b85be24054 update to work with fnmatch
4 years ago
contrastive cfc15283a4 Update TNT url
4 years ago
contrastive 4a09bc851e Add TNT model
4 years ago
Aman Arora 20626e8387 Add to extract stats for SPP
4 years ago
Ross Wightman cf5fec5047 Cleanup experimental vit weight init a bit
4 years ago
Ross Wightman f42f1df26c Improve evenness of per-worker split for validation set with TFDS
4 years ago
Ross Wightman cbcb76d72c Should have included Conv2d layers in original weight init. Lets see what the impact is...
4 years ago
Ross Wightman 4de57ccf01 Add weight init scheme that's closer to JAX impl
4 years ago
Ross Wightman 14ac4abf74 Change huggingface hub revision delimiter to '@', add hf_hub reference for eca_nfnet_l0 model as an example.
4 years ago
Ross Wightman 740f32c96a Add ECA-NFNet-L0 weights and update model name. Update README and bump version to 0.4.6
4 years ago
Ross Wightman 5e2e4e7fb6
Merge pull request #502 from khatami-mehrdad/efficientnet_dg
4 years ago
Ross Wightman 45c048ba13 A few minor fixes and bit more cleanup on the huggingface hub integration.
4 years ago
Mehrdad 97f49d163c adding efficientnet_el, efficientnet_es_prunedefficientnet_es_pruned and efficientnet_el_pruned pretrained models to timm/models
4 years ago
Ross Wightman ead80d33c5 Fix typo, naming consistency
4 years ago
Ross Wightman d584e7f617 Support for huggingface hub via create_model and default_cfgs.
4 years ago
Ross Wightman 1f799afcb3 Remove capital letters in model names
4 years ago
AFLALO, Jonathan Isaac 82c1fe958a added hardcorenas model
4 years ago
Ross Wightman f57db99101 Update README, fix iabn pip version print.
4 years ago
Ross Wightman 7ef7788ee9 Fix CUDA crash w/ channels-last + CSP models. Remove use of chunk()
4 years ago
Ross Wightman 317ea3e599 Tweak NFNet, ResNetV2, RexNet feature extraction, use pre-act features for simplicty. Update tests for pytorch 1.8. 0.4.5 version bump.
4 years ago
Ross Wightman 17cdee7354 Fix C&P patch_size error, and order of op patch_size arg resolution bug. Remove a test vit model.
4 years ago
Ross Wightman 0706d05d52 Benchmark models listed in txt file. Add more hybrid vit variants for testing
4 years ago
Ross Wightman 2db2d87ff7 Add epoch-repeats arg to multiply the number of dataset passes per epoch. Currently for iterable datasets (read TFDS wrapper) only.
4 years ago
Ross Wightman de97be9146 Spell out diff between my small and deit small vit models.
4 years ago
Ross Wightman f0ffdf89b3 Add numerous experimental ViT Hybrid models w/ ResNetV2 base. Update the ViT naming for hybrids. Fix #426 for pretrained vit resizing.
4 years ago
Ross Wightman 0e16d4e9fb Add benchmark.py script, and update optimizer factory to be more friendly to use outside of argparse interface.
4 years ago
Ross Wightman 4bc103f504 Fix CUDA crash w/ channels-last + CSP models. Remove use of chunk()
4 years ago
Ross Wightman 8563609b28 Update notes in ScaledStdConv impl
4 years ago
Ross Wightman 678ba4e0a2 Add NFNet-F model weights ported from DeepMind Haiku impl and new set of models w/ compatible config.
4 years ago
Ross Wightman 9de2ec5e44 Update README for AGC and bump version to 0.4.4
4 years ago
Ross Wightman 4f49b94311 Initial AGC impl. Still testing.
4 years ago
Ross Wightman 5f9aff395c Fix stem width in NFNet-F models, add some more comments, add some 'light' NFNet models for testing.
4 years ago
Ross Wightman d86dbe45c2 Update README.md and few more comments
4 years ago
Ross Wightman 0d253e2c5e Fix issue with nfnet tests, bit more cleanup.
4 years ago
Ross Wightman cb06c7a910 Add NFNet-F models and tweak existing NF models.
4 years ago
Ross Wightman e4de077021 Add first 'Normalizer Free' models. nf_regnet_b1 79.3 @ 288x288 test, and nf_resnet50 80.3 @ 256x256 test (80.68 @ 288x288).
4 years ago
Ross Wightman d8e69206be
Merge pull request #419 from rwightman/byob_vgg_models
4 years ago
Ross Wightman ca9b078ac7 Update README.md and docs. Version bumped to 0.4.3
4 years ago
Ross Wightman 6853b07bbd Improve RegVGG block identity/vs non for clariy and fix attn usage. Add comments.
4 years ago
Ross Wightman 0356e773f5 Default to native PyTorch AMP instead of APEX amp. Too many APEX issues cropping up lately.
4 years ago
Reuben 94ca140b67 update collections.abc import
4 years ago
Ross Wightman b4e216e377 Fix a few small things.
4 years ago
Ross Wightman dc85e5a237 Add ByobNet w/ GPU-EfficientNets and RepVGG. Also add classic vgg models.
4 years ago
Ross Wightman 1bcc69e0ad Use in_channels for depthwise groups, allows using `out_channels=N * in_channels` (does not impact existing models). Fix #354.
4 years ago
Ross Wightman 9811e229f7 Fix regression in models with 1001 class pretrained weights. Improve batchnorm arg and BatchNormAct layer handling in several models.
4 years ago
Ross Wightman a39c3ee216
Merge branch 'master' into eca-weights
4 years ago
Ross Wightman e9d6fe293c Update README for new weights. Version 0.4.2
4 years ago
Ross Wightman 666de85cf1 Move stride in EdgeResidual block to 3x3 expansion conv. Fix #414
4 years ago
Ross Wightman 3b57490a63 Fix some half removed resnet model defs, pooling for ecaresnet269d
4 years ago
Ross Wightman 68a4144882 Add new weights for ecaresnet26t/50t/269d models. Remove distinction between 't' and 'tn' (tiered models), tn is now t. Add test time img size spec to default cfg.
4 years ago
Ross Wightman b9843f954b
Merge pull request #282 from tigert1998/patch-1
4 years ago
hwangdeyu 7a4be5c035 add operator HardSwishJitAutoFn export to onnx
4 years ago
Ross Wightman 4203efa36d Fix #387 so that checkpoint saver works with max history of 1. Add checkpoint-hist arg to train.py.
4 years ago
Ross Wightman f0e65e37b7 Fix NF-ResNet101 model defs
4 years ago
Ross Wightman 2c988c3b6e Update README.md for NF-nets, bump version to 0.4.1 for merge
4 years ago
Ross Wightman 2de54d174a Fix pool size defs for NFNet models, add a comment.
4 years ago
Ross Wightman 90980de4a9 Fix up a few details in NFResNet models, managed stable training. Add support for gamma gain to be applied in activation or ScaleStdConv. Some tweaks to ScaledStdConv.
4 years ago
Ross Wightman 5a8e1e643e Initial Normalizer-Free Reg/ResNet impl. A bit of related layer refactoring.
4 years ago
Ross Wightman 38d8f67570 Fix potential issue with change to num_classes arg in train/validate.py defaulting to None (rely on model def / default_cfg)
4 years ago
Ross Wightman 587780e56b Update README.md and bump version to 0.4.0
4 years ago
Ross Wightman bb50ac4708 Add DeiT distilled weights and distilled model def. Remove some redudant ViT model args.
4 years ago
Ross Wightman c16e965037 Add some ViT comments and fix a few minor issues.
4 years ago
Ross Wightman 22748f1a2d Convert samples/targets in ParserImageInTar to numpy arrays, slightly less mem usage for massive datasets. Add a few more se/eca model defs to resnet.py
4 years ago
Ross Wightman 5d4c3d0af3 Add enhanced ParserImageInTar that can read images from tars within tars, folders with multiple tars, etc. Additional comment cleanup.
4 years ago
Ross Wightman 55f7dfa9ea Refactor vision_transformer entrpy fns, add pos embedding resize support for fine tuning, add some deit models for testing
4 years ago
Ross Wightman d55bcc0fee Finishing adding stochastic depth support to BiT ResNetV2 models
4 years ago
Ross Wightman 855d6cc217 More dataset work including factories and a tensorflow datasets (TFDS) wrapper
4 years ago
Ross Wightman 20516abc18 Fix some broken tests for ResNetV2 BiT models
4 years ago
Ross Wightman 59ec7e6a53 Merge branch 'master' into imagenet21k_datasets_more
4 years ago
Ross Wightman e7a9ddf982
Merge pull request #334 from kecsap/links
4 years ago
Csaba Kertesz 7cae7e7035 Follow links during dataset scanning
4 years ago
Ross Wightman c96e9f99a0 Update version to 0.3.3
4 years ago
Ross Wightman 4e2533db77 Add 320x320 model default cfgs for 101D and 152D ResNets. Add SEResNet-152D weights and 320x320 cfg.
4 years ago
Ross Wightman 0167f749d3 Remove some old __future__ imports
4 years ago
Ross Wightman e35e9760a6 More work on dataset / parser split and imagenet21k (tar) support
4 years ago
Ross Wightman ce69de70d3 Add 21k weight urls to vision_transformer. Cleanup feature_info for preact ResNetV2 (BiT) models
4 years ago
Ross Wightman 231d04e91a ResNetV2 pre-act and non-preact model, w/ BiT pretrained weights and support for ViT R50 model. Tweaks for in21k num_classes passing. More to do... tests failing.
4 years ago
Ross Wightman de6046e213 Initial commit for dataset / parser reorg to support additional datasets / types
4 years ago
Ross Wightman 392595c7eb Add pool_size to default cfgs for new models to prevent tests from failing. Add explicit 200D_320 model entrypoint for next benchmark run.
4 years ago
Ross Wightman b1f1228a41 Add ResNet101D, 152D, and 200D weights, remove meh 66d model
4 years ago
Jasha 7c56c718f3 Configure create_optimizer with args.opt_args
4 years ago
Ross Wightman 9a25fdf3ad
Merge pull request #297 from rwightman/ema_simplify
4 years ago
Tymoteusz Wiśniewski de15b43865 Fix a bug with accuracy retrieving from RealLabels
4 years ago
Ross Wightman cd72e66eff Bug in last mod for features_only default_cfg
4 years ago
Ross Wightman 867a0e5a04 Add default_cfg back to models wrapped in feature extraction module as per discussion in #294.
4 years ago
Ross Wightman 4ca52d73d8 Add separate set and update method to ModelEmaV2
4 years ago
Ross Wightman 2ed8f24715 A few more changes for 0.3.2 maint release. Linear layer change for mobilenetv3 and inception_v3, support no bias for linear wrapper.
4 years ago
Ross Wightman 6504a42832 Version 0.3.2
4 years ago
Ross Wightman 460eba7f24 Work around casting issue with combination of native torch AMP and torchscript for Linear layers
4 years ago
Ross Wightman 5f4b6076d8 Fix inplace arg compat for GELU and PreLU via activation factory
4 years ago
Ross Wightman fd962c4b4a Native SiLU (Swish) op doesn't export to ONNX
4 years ago
Ross Wightman 27bbc70d71 Add back old ModelEma and rename new one to ModelEmaV2 to avoid compat breaks in dependant code. Shuffle train script, add a few comments, remove DataParallel support, support experimental torchscript training.
4 years ago
tigertang 43f2500c26
Add symbolic for SwishJitAutoFn to support onnx
4 years ago
Ross Wightman 9214ca0716 Simplifying EMA...
4 years ago
Ross Wightman 53aeed3499 ver 0.3.1
4 years ago
Ross Wightman 30ab4a1494 Fix issue in optim factory with sgd / eps flag. Bump version to 0.3.1
4 years ago
Ross Wightman 741572dc9d Bump version to 0.3.0 for pending PyPi push
4 years ago
Ross Wightman b401952caf Add newly added vision transformer large/base 224x224 weights ported from JAX official repo
4 years ago
Ross Wightman 61200db0ab in_chans=1 working w/ pretrained weights for vision_transformer
4 years ago
Ross Wightman e90edce438 Support native silu activation (aka swish). An optimized ver is available in PyTorch 1.7.
4 years ago
Ross Wightman da6cd2cc1f Fix regression for pretrained classifier loading when using entrypt functions directly
4 years ago
Ross Wightman f591e90b0d Make sure num_features attr is present in vit models as with others
4 years ago
Ross Wightman 4a3df7842a Fix topn metric view regression on PyTorch 1.7
4 years ago
Ross Wightman f944242cb0 Fix #262, num_classes arg mixup. Make vision_transformers a bit closer to other models wrt get/reset classfier/forward_features. Fix torchscript for ViT.
4 years ago
Ross Wightman 736f209e7d Update vision transformers to be compatible with official code. Port official ViT weights from jax impl.
4 years ago
Ross Wightman 477a78ed81 Fix optimizer factory regressin for optimizers like sgd/momentum that don't have an eps arg
4 years ago
Ross Wightman 27a93e9de7 Improve test crop for ViT models. Small now 77.85, added base weights at 79.35 top-1.
4 years ago
Ross Wightman d4db9e7977 Add small vision transformer weights. 77.42 top-1.
4 years ago
talrid 27fadaa922 asymmetric_loss
4 years ago
Ross Wightman f31933cb37 Initial Vision Transformer impl w/ patch and hybrid variants. Refactor tuple helpers.
4 years ago
Ross Wightman a4d8fea61e Add model based wd skip support. Improve cross version compat of optimizer factory. Fix #247
4 years ago
Ross Wightman 80078c47bb Add Adafactor and Adahessian optimizers, cleanup optimizer arg passing, add gradient clipping support.
4 years ago
Ross Wightman fcb6258877 Add missing leaky_relu layer factory defn, update Apex/Native loss scaler interfaces to support unscaled grad clipping. Bump ver to 0.2.2 for pending release.
4 years ago
Ross Wightman e8e2d9cabf Add DropPath (stochastic depth) to ReXNet and VoVNet. RegNet DropPath impl tweak and dedupe se args.
4 years ago
Ross Wightman e8ca45854c More models in sotabench, more control over sotabench run, dataset filename extraction consistency
4 years ago
Ross Wightman 9c406532bd Add EfficientNet-EdgeTPU-M (efficientnet_em) model trained natively in PyTorch. More sotabench fiddling.
4 years ago
Ross Wightman c40384f5bd Add ResNet weights. 80.5 (top-1) ResNet-50-D, 77.1 ResNet-34-D, 72.7 ResNet-18-D.
4 years ago
Ross Wightman 47a7b3b5b1 More flexible mixup mode, add 'half' mode.
4 years ago
Ross Wightman 532e3b417d Reorg of utils into separate modules
4 years ago
Ross Wightman 33f8a1bf36 Updated README, add wide_resnet50_2 and seresnext50_32x4d weights
4 years ago
Ross Wightman 751b0bba98 Add global_pool (--gp) arg changes to allow passing 'fast' easily for train/validate to avoid channels_last issue with AdaptiveAvgPool
4 years ago
Ross Wightman 9c297ec67d Cleanup Apex vs native AMP scaler state save/load. Cleanup CheckpointSaver a bit.
4 years ago
Ross Wightman 80c9d9cc72 Add 'fast' global pool option, remove redundant SEModule from tresnet, normal one is now 'fast'
4 years ago
Ross Wightman 90a01f47d1 hrnet features_only pretrained weight loading issue. Fix #232.
4 years ago
Ross Wightman 110a7c4982 AdaptiveAvgPool2d -> mean((2,3)) for all SE/attn layers to avoid NaN with AMP + channels_last layout. See https://github.com/pytorch/pytorch/issues/43992
4 years ago
Ross Wightman c2cd1a332e Improve torch amp support and add channels_last support for train/validate scripts
4 years ago
Ross Wightman 470220b1f4 Fix MobileNetV3 crash with global_pool='', output consistent with other models but not equivalent due to efficient head.
4 years ago
Ross Wightman fc8b8afb6f Fix a silly bug in Sample version of EvoNorm missing x* part of swish, update EvoNormBatch to accumulated unbiased variance.
4 years ago
Ross Wightman f614df7921 Bump version to 0.2.1 and update README
4 years ago
Ross Wightman b423bc8362
Merge pull request #218 from rwightman/cutmix
4 years ago
Ross Wightman 8c9814e3f5 Final cleanup of mixup/cutmix. Element/batch modes working with both collate (prefetcher active) and without prefetcher.
4 years ago
Ross Wightman 0f5d9d8166 Add CSPResNet50 weights, 79.6 top-1 at 256x256
4 years ago
Ross Wightman b1b6e7c361 Fix a few more issues related to #216 w/ TResNet (space2depth) and FP16 weights in wide resnets. Also don't completely dump pretrained weights in in_chans != 1 or 3 cases.
4 years ago
Ross Wightman 512b2dd645 Add new EfficientNet-B3 and RegNetY-3.2GF weights, both just over 82 top-1
4 years ago
Ross Wightman 6890300877 Add DropPath (stochastic depth) to RegNet
4 years ago
Ross Wightman cd23f55397 Fix mixed prec issues with new mixup code
4 years ago
Yusuke Uchida f6b56602f9 fix test_model_default_cfgs
4 years ago
Ross Wightman f471c17c9d More cutmix/mixup overhaul, ready to kick-off some trials.
4 years ago
Ross Wightman d5145fa4d5 Change default_cfg names for senet to include the legacy and match model names
4 years ago
Ross Wightman 92f2d0d65d Merge branch 'master' into cutmix. Fixup a few issues.
4 years ago
Ross Wightman 1696499ce5 Bump version to 0.2.0, ready to roll (I think)
4 years ago
Ross Wightman e62758cf4f More documentation updates, fix a typo
4 years ago
Ross Wightman dfe80414a6 Add bool arg helper
4 years ago
Ross Wightman fa28067704 Add more augmentation arguments, including a no_aug disable flag. Fix #209
4 years ago
Ross Wightman b1f1a54de9 More uniform treatment of classifiers across all models, reduce code duplication.
4 years ago
Ross Wightman d72ddafe56 Fix some checkpoint / model str regressions
4 years ago
Ross Wightman ac18adb9c3 Remove debug print from RexNet
4 years ago
Ross Wightman c53ec33ae0 Add synset/label indices for results generation. Add 'valid labels' to validation script to support imagenet-a/r label subsets properly.
4 years ago
Ross Wightman ec4976fdba Add EfficientNet-Lite0 weights trained with this code by @hal-314, 75.484 top-1
4 years ago
Ross Wightman 9ecd16bd7b Add new seresnet50 (non-legacy) model weights, 80.274 top-1
4 years ago
Ross Wightman 7995295968 Merge branch 'logger' into features. Change 'logger' to '_logger'.
4 years ago
Ross Wightman 1998bd3180 Merge branch 'feature/AB/logger' of https://github.com/antoinebrl/pytorch-image-models into logger
4 years ago
Ross Wightman 6c17d57a2c Fix some attributions, add copyrights to some file docstrings
4 years ago
Ross Wightman a69c0e04f0 Fix pool size in cspnet
4 years ago
Ross Wightman 14ef7a0dd6 Rename csp.py -> cspnet.py
4 years ago
Ross Wightman ec37008432 Add pretrained weight links to CSPNet for cspdarknet53, cspresnext50
4 years ago
Sangdoo Yun e93e571f7a Add `adamp` and 'sgdp' optimizers.
4 years ago
Ross Wightman 08016e839d Cleanup FeatureInfo getters, add TF models sourced Xception41/65/71 weights
4 years ago
Ross Wightman 7ba5a384d3 Add ReXNet w/ remapped weights, feature support
4 years ago
Ross Wightman c9d54bc1c3 Add HRNet feature extraction, fix senet type, lower feature testing res to 96x96
4 years ago
Ross Wightman 2ac663f340 Add feature support to legacy senets, add 32x32 resnext models to exclude list for feature testing.
4 years ago
Ross Wightman c146b54abc Cleanup EfficientNet/MobileNetV3 feature extraction a bit, only two tap locations now, small mobilenetv3 models work
4 years ago
Ross Wightman 68fd8a267b Merge branch 'master' into features
4 years ago
Ross Wightman 4e61c6a12d Cleanup, refactoring of Feature extraction code, add tests, fix tests, non hook feature extraction working with torchscript
4 years ago
Ross Wightman 6eec3fb4a4 Move FeatureHooks into features.py, switch EfficientNet, MobileNetV3 to use build model helper
4 years ago
Ross Wightman 9eba134d79 More models supporting feature extraction, xception, gluon_xception, inception_v3, inception_v4, pnasnet, nasnet, dla. Fix DLA unused projection params.
4 years ago
Ross Wightman 298fba09ac Back out some activation hacks trialing upcoming pytorch changes
4 years ago