Ross Wightman
ac469b50da
Optimizer improvements, additions, cleanup
...
* Add MADGRAD code
* Fix Lamb (non-fused variant) to work w/ PyTorch XLA
* Tweak optimizer factory args (lr/learning_rate and opt/optimizer_name), may break compat
* Use newer fn signatures for all add,addcdiv, addcmul in optimizers
* Use upcoming PyTorch native Nadam if it's available
* Cleanup lookahead opt
* Add optimizer tests
* Remove novograd.py impl as it was messy, keep nvnovograd
* Make AdamP/SGDP work in channels_last layout
* Add rectified adablief mode (radabelief)
* Support a few more PyTorch optim, adamax, adagrad
3 years ago
Sepehr Sameni
abf3e044bb
Update scheduler_factory.py
...
remove duplicate code from create_scheduler()
3 years ago
Ross Wightman
3cdaf5ed56
Add `mmax` config key to auto_augment for increasing upper bound of RandAugment magnitude beyond 10. Make AugMix uniform sampling default not override config setting.
3 years ago
Ross Wightman
1042b8a146
Add non fused LAMB optimizer option
3 years ago
Ross Wightman
01cb46a9a5
Add gc_efficientnetv2_rw_t weights (global context instead of SE attn). Add TF XL weights even though the fine-tuned ones don't validate that well. Change default arg for GlobalContext to use scal (mul) mode.
3 years ago
Ross Wightman
d3f7440650
Add EfficientNetV2 XL model defs
3 years ago
Ross Wightman
72b227dcf5
Merge pull request #750 from drjinying/master
...
Specify "interpolation" mode in vision_transformer's resize_pos_embed
3 years ago
Ross Wightman
2907c1f967
Merge pull request #746 from samarth4149/master
...
Adding a Multi Step LR Scheduler
3 years ago
Ross Wightman
748ab852ca
Allow act_layer switch for xcit, fix in_chans for some variants
3 years ago
Ying Jin
20b2d4b69d
Use bicubic interpolation in resize_pos_embed()
3 years ago
Ross Wightman
d3255adf8e
Merge branch 'xcit' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-xcit
3 years ago
Ross Wightman
f8039c7492
Fix gc effv2 model cfg name
3 years ago
Alexander Soare
3a55a30ed1
add notes from author
3 years ago
Alexander Soare
899cf84ccc
bug fix - missing _dist postfix for many of the 224_dist models
3 years ago
Alexander Soare
623e8b8eb8
wip xcit
3 years ago
Ross Wightman
392368e210
Add efficientnetv2_rw_t defs w/ weights, and gc variant, as well as gcresnet26ts for experiments. Version 0.4.13
3 years ago
samarth
daab57a6d9
1. Added a simple multi step LR scheduler
3 years ago
Ross Wightman
6d8272e92c
Add SAM pretrained model defs/weights for ViT B16 and B32 models.
3 years ago
Ross Wightman
ee4d8fc69a
Remove unecessary line from nest post refactor
3 years ago
Ross Wightman
8165cacd82
Realized LayerNorm2d won't work in all cases as is, fixed.
3 years ago
Ross Wightman
81cd6863c8
Move aggregation (convpool) for nest into NestLevel, cleanup and enable features_only use. Finalize weight url.
3 years ago
Ross Wightman
6ae0ac6420
Merge branch 'nested_transformer' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-nested_transformer
3 years ago
Alexander Soare
7b8a0017f1
wip to review
3 years ago
Alexander Soare
b11d949a06
wip checkpoint with some feature extraction work
3 years ago
Alexander Soare
23bb72ce5e
nested_transformer wip
3 years ago
Ross Wightman
766b4d3262
Fix features for resnetv2_50t
3 years ago
Ross Wightman
e8045e712f
Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
3 years ago
Ross Wightman
20a2be14c3
Add gMLP-S weights, 79.6 top-1
3 years ago
Ross Wightman
85f894e03d
Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
3 years ago
Ross Wightman
b41cffaa93
Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
3 years ago
Ross Wightman
9c9755a808
AugReg release
3 years ago
Ross Wightman
381b279785
Add hybrid model fwds back
3 years ago
Ross Wightman
26f04a8e3e
Fix a weight link
3 years ago
Ross Wightman
8f4a0222ed
Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
3 years ago
Ross Wightman
4c09a2f169
Bump version 0.4.12
3 years ago
Ross Wightman
b319eb5b5d
Update ViT weights, more details to be added before merge.
3 years ago
Ross Wightman
8257b86550
Fix up resnetv2 bit/bitm model default res
3 years ago
Ross Wightman
1228f5a3d8
Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
3 years ago
Ross Wightman
511a8e8c96
Add official ResMLP weights.
3 years ago
Ross Wightman
b9cfb64412
Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
3 years ago
Ross Wightman
8319e0c373
Add file docstring to std_conv.py
3 years ago
Ross Wightman
4d96165989
Merge branch 'master' into cleanup_xla_model_fixes
3 years ago
Ross Wightman
8880f696b6
Refactoring, cleanup, improved test coverage.
...
* Add eca_nfnet_l2 weights, 84.7 @ 384x384
* All 'non-std' (ie transformer / mlp) models have classifier / default_cfg test added
* Fix #694 reset_classifer / num_features / forward_features / num_classes=0 consistency for transformer / mlp models
* Add direct loading of npz to vision transformer (pure transformer so far, hybrid to come)
* Rename vit_deit* to deit_*
* Remove some deprecated vit hybrid model defs
* Clean up classifier flatten for conv classifiers and unusual cases (mobilenetv3/ghostnet)
* Remove explicit model fns for levit conv, just pass in arg
3 years ago
Ross Wightman
ba2ca4b464
One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman
b7a568f065
Fix torchscript issue in bat
3 years ago
Ross Wightman
d17b374f0f
Minimum input_size needed to be higher
3 years ago
Ross Wightman
b3b90d944d
Add min_input_size to bat_resnext to prevent test breakage.
3 years ago
Ross Wightman
d413eef1bf
Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
3 years ago
Ross Wightman
10d8fa4620
Add gc and bat attention resnext26ts variants to byob for test.
3 years ago
Ross Wightman
2f5ed2dec1
Update `init_values` const for 24 and 36 layer ResMLP models
3 years ago
Ross Wightman
8e4ac3549f
All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
3 years ago
Ross Wightman
2a63d0246b
Post merge cleanup
3 years ago
Ross Wightman
45dec179e5
Merge pull request #681 from lmk123568/master
...
Update convit.py
3 years ago
Dongyoon Han
ded1671483
Fix stochastic depth working only with a shortcut
3 years ago
Mike
b87d98b238
Update convit.py
...
Cut out the duplicates
3 years ago
Ross Wightman
02320c3e3d
Bump version to 0.4.11
4 years ago
Ross Wightman
bda8ab015a
Remove min channels for SelectiveKernel, divisor should cover cases well enough.
4 years ago
Ross Wightman
a27f4aec4a
Missed args for skresnext w/ refactoring.
4 years ago
Ross Wightman
307a935b79
Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
4 years ago
Ross Wightman
8bf63b6c6c
Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
4 years ago
Ross Wightman
bcec14d3b5
Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
4 years ago
Ross Wightman
9611458e19
Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
4 years ago
Ross Wightman
01b9108619
Merge branch 'master' into more_attn
4 years ago
Ross Wightman
d7bab8a6c5
Fix strict flag change for checkpoint load.
4 years ago
Ross Wightman
02f9d4bc34
Add weights for resnet51q model, add 61q def.
4 years ago
Ross Wightman
f615474be3
Fix broken test, repvgg block doesn't have attn_last attr.
4 years ago
Ross Wightman
742c2d5247
Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
4 years ago
Ross Wightman
9c78de8c02
Fix #661 , move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
4 years ago
Ross Wightman
5db7452173
Fix visformer in_chans stem handling
4 years ago
Ross Wightman
318360c3f9
Update README.md before merge. Bump version to 0.4.10
4 years ago
Ross Wightman
11ae795e99
Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
4 years ago
Ross Wightman
d400f1dbdd
Filter test models before creation for backward/torchscript tests
4 years ago
Ross Wightman
c4572cc5aa
Add Visformer-small weighs, tweak torchscript jit test img size.
4 years ago
Ross Wightman
bfc72f75d3
Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
4 years ago
Ross Wightman
18bf520ad1
Add eca_nfnet_l2/l3 defs for future training
4 years ago
Ross Wightman
f45de37690
Merge branch 'master' into levit_visformer_rednet
4 years ago
Ross Wightman
23c18a33e4
Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
4 years ago
Ross Wightman
c2ba229d99
Prep for effcientnetv2_rw_m model weights that started training before official release..
4 years ago
Ross Wightman
30b9880d06
Minor adjustment, mutable default arg, extra check of valid len...
4 years ago
Ross Wightman
be0abfbcce
Merge branch 'master' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-master
4 years ago
Ross Wightman
b7de82e835
ConViT cleanup, fix torchscript, bit of reformatting, reuse existing layers.
4 years ago
Ross Wightman
306c86b668
Merge branch 'convit' of https://github.com/amaarora/pytorch-image-models into amaarora-convit
4 years ago
Ross Wightman
a569635045
Update twin weights to a copy in GitHub releases for faster dl. Tweak model class comment.
4 years ago
Ross Wightman
be99eef9c1
Remove redundant code, cleanup, fix torchscript.
4 years ago
Ross Wightman
5ab372a3ec
Merge branch 'master' of https://github.com/abcdvzz/pytorch-image-models into abcdvzz-master
4 years ago
Aman Arora
5db1eb6ba5
Add defaults
4 years ago
Aman Arora
8b1f2e8e1f
remote unused matplotlib import
4 years ago
Aman Arora
40c506ba1e
Add ConViT
4 years ago
Alexander Soare
7976019864
extend positional embedding resizing functionality to tnt
4 years ago
Alexander Soare
8086943b6f
allow resize positional embeddings to non-square grid
4 years ago
talrid
dc1a4efd28
mixer_b16_224_miil, mixer_b16_224_miil_in21k models
4 years ago
李鑫杰
7b799c4e79
add latest code
4 years ago
Ross Wightman
d5af752117
Add preliminary gMLP and ResMLP impl to Mlp-Mixer
4 years ago
李鑫杰
00548b8427
Add Twins
4 years ago
Ross Wightman
e7f0db8664
Fix drop/drop_path arg on MLP-Mixer model. Fix #641
4 years ago
Ross Wightman
9a3ae97311
Another set of byoanet models w/ ECA channel + SA + groups
4 years ago
Ross Wightman
d53e91218e
Fix tf.data options setting for newer TF versions
4 years ago
Ross Wightman
7077f16c6a
Change 21k model naming from _21k to _in21k for consistency with existing 21k models.
4 years ago
Ross Wightman
94d4b53352
Add temporary default_cfgs to visformer models so they pass tests
4 years ago
Ross Wightman
3bffc701f1
Merge branch 'master' into levit_visformer_rednet
4 years ago
Ross Wightman
ecc7552c5c
Add levit, levit_c, and visformer model defs. Largely untested and not finished cleanup.
4 years ago
Ross Wightman
165fb354b2
Add initial RedNet model / Involution layer impl for testing
4 years ago
Ross Wightman
328249f11a
Update README, tweak fine-tune effv2 model names.
4 years ago
Ross Wightman
c4f482a08b
EfficientNetV2 official impl w/ weights ported from TF. Cleanup/refactor of related EfficientNet classes and models.
4 years ago
Ross Wightman
4fbc32d3d0
Fix crop_pct for cait models.
4 years ago
Ross Wightman
715519a5ef
Rethink name of patch embed grid info
4 years ago
Ross Wightman
b2c305c2aa
Move Mlp and PatchEmbed modules into layers. Being used in lots of models now...
4 years ago
Ross Wightman
3ba6b55cb2
More adjustments to ByoaNet models for further experiments.
4 years ago
Ross Wightman
5fcddb96a8
Merge branch 'master' into cait
4 years ago
Ross Wightman
3db12b4b6a
Finish CaiT cleanup
4 years ago
Ross Wightman
2d8b09fe8b
Add official pretrained weights to MLP-Mixer, complete model cfgs.
4 years ago
Ross Wightman
12efffa6b1
Initial MLP-Mixer attempt...
4 years ago
Ross Wightman
0721559511
Improved (hopefully) init for SA/SA-like layers used in ByoaNets
4 years ago
Ross Wightman
d5473c17f7
Fix incorrect name of shortcut/identity paths in many residual nets. Inherited from naming in old old torchvision, long fixed there.
4 years ago
Ross Wightman
0d87650fea
Remove filter hack from BlurPool w/ non-persistent buffer. Use BlurPool2d instead of AntiAliasing.. for TResNet. Breaks PyTorch < 1.6.
4 years ago
Ross Wightman
ddc743fdf8
Update ResNet-RS models to EMA weights
4 years ago
Ross Wightman
08d60f4a9a
resnetrs50 pool sizing wrong
4 years ago
Ross Wightman
1daa15ecc3
Initial Cait commit. Still some cleanup to do.
4 years ago
Ross Wightman
67d0665b46
Post ResNet-RS merge cleanup. Add weight urls, adjust train/test/crop pct.
4 years ago
Aman Arora
560eae38f5
[WIP] Add ResNet-RS models ( #554 )
...
* Add ResNet-RS models
* Only include resnet-rs changes
* remove whitespace diff
* EOF newline
* Update time
* increase time
* Add first conv
* Try running only resnetv2_101x1_bitm on Linux runner
* Add to exclude filter
* Run test_model_forward_features for all
* Add to exclude ftrs
* back to defaults
* only run test_forward_features
* run all tests
* Run all tests
* Add bigger resnetrs to model filters to fix Github CLI
* Remove resnetv2_101x1_bitm from exclude feat features
* Remove hardcoded values
* Make sure reduction ratio in resnetrs is 0.25
* There is no bias in replaced maxpool so remove it
4 years ago
Ross Wightman
9cc7dda6e5
Fixup byoanet configs to pass unit tests. Add swin_attn and swinnet26t model for testing.
4 years ago
Ross Wightman
e15c3886ba
Defaul lambda r=7. Define '26t' stage 4/5 256x256 variants for all of bot/halo/lambda nets for experiment. Add resnet50t for exp. Fix a few comments.
4 years ago
Ross Wightman
e5e15754c9
Fix coat first conv ident
4 years ago
Ross Wightman
76739a7589
CoaT merge. Bit of formatting, fix torchscript (for non features), remove einops/einsum dep, add pretrained weight hub (url) support.
4 years ago
Ross Wightman
026430c083
Merge branch 'master' of https://github.com/morizin/pytorch-image-models-1 into morizin-master
4 years ago
Ross Wightman
a0492e3b48
A few miil weights naming tweaks to improve compat with model registry and filtering wildcards.
4 years ago
talrid
8c1f03e56c
comment
4 years ago
talrid
19e1b67a84
old spaces
4 years ago
talrid
a443865876
update naming and scores
4 years ago
talrid
cf0e371594
84_0
4 years ago
talrid
0968bdeca3
vit, tresnet and mobilenetV3 ImageNet-21K-P weights
4 years ago
morizin
1e3b6d4dfc
Update __init__.py
4 years ago
morizin
fd022fd6a2
Update __init__.py
4 years ago
morizin
c2d5087eae
Add files via upload
4 years ago
Norman Mu
79640fcc1f
Enable uniform augmentation magnitude sampling and set AugMix default
4 years ago
Ross Wightman
c1cf9712fc
Add updated EfficientNet-V2S weights, 83.8 @ 384x384 test. Add PyTorch trained EfficientNet-B4 weights, 83.4 @ 384x384 test. Tweak non TF EfficientNet B1-B4 train/test res scaling.
4 years ago
Ross Wightman
e8a64fb881
Test input size for efficientnet_v2s was wrong in last results run
4 years ago
Ross Wightman
2df77ee5cb
Fix torchscript compat and features_only behaviour in GhostNet PR. A few minor formatting changes. Reuse existing layers.
4 years ago
Ross Wightman
d793deb51a
Merge branch 'master' of https://github.com/iamhankai/pytorch-image-models into iamhankai-master
4 years ago
Ross Wightman
e685618f45
Merge pull request #550 from amaarora/wandb
...
Wandb Support
4 years ago
Ross Wightman
f606c45c38
Add Swin Transformer models from https://github.com/microsoft/Swin-Transformer
4 years ago
iamhankai
de445e7827
Add GhostNet
4 years ago
Ross Wightman
5a196dddf6
Update README.md with latest, bump version to 0.4.8
4 years ago
Ross Wightman
b3d7580df1
Update ByoaNet comments. Fix first Steam feat chs for ByobNet.
4 years ago
Ross Wightman
16f7aa9f54
Add default_cfg options for min_input_size / fixed_input_size, queries in model registry, and use for testing self-attn models
4 years ago
Ross Wightman
4e4b863b15
Missed norm.py
4 years ago
Ross Wightman
7c97e66f7c
Remove commented code, add more consistent seed fn
4 years ago
Ross Wightman
364dd6a58e
Merge branch 'master' into byoanet-self_attn
4 years ago
Ross Wightman
ce62f96d4d
ByoaNet with bottleneck transformer, lambda resnet, and halo net experiments
4 years ago
Ross Wightman
cd3dc4979f
Fix adabelief imports, remove prints, preserve memory format is the default arg for zeros_like
4 years ago
Ross Wightman
21812d33aa
Add prelim efficientnet_v2s weights from 224x224 train, eval 83.3 @ 288. Add eca_nfnet_l1 weights, train at 256, eval 84 @ 320.
4 years ago
Aman Arora
5772c55c57
Make wandb optional
4 years ago
Aman Arora
f54897cc0b
make wandb not required but rather optional as huggingface_hub
4 years ago
Aman Arora
3f028ebc0f
import wandb in summary.py
4 years ago
Aman Arora
624c9b6949
log to wandb only if using using wandb
4 years ago
juntang
addfc7c1ac
adabelief
4 years ago
Ross Wightman
fb896c0b26
Update some comments re preliminary EfficientNet-V2 assumptions
4 years ago
Ross Wightman
2b49ab7a36
Fix ResNetV2 pretrained classifier issue. Fixes #540
4 years ago
Ross Wightman
de9dff933a
EfficientNet-V2S preliminary model def (for experimentation)
4 years ago
Ross Wightman
37c71a5609
Some further create_optimizer_v2 tweaks, remove some redudnant code, add back safe model str. Benchmark step times per batch.
4 years ago
Ross Wightman
2bb65bd875
Wrong default_cfg pool_size for L1
4 years ago
Ross Wightman
bf2ca6bdf4
Merge jax and original weight init
4 years ago
Ross Wightman
acbd698c83
Update README.md with updates. Small tweak to head_dist handling.
4 years ago
Ross Wightman
9071568f0e
Add weights for SE NFNet-L0 model, rename nfnet_l0b -> nfnet_l0. 82.75 top-1 @ 288. Add nfnet_l1 model def for training.
4 years ago
Ross Wightman
c468c47a9c
Add regnety_160 weights from DeiT teacher model, update that and my regnety_032 weights to use higher test size.
4 years ago
Ross Wightman
288682796f
Update benchmark script to add precision arg. Fix some downstream (DeiT) compat issues with latest changes. Bump version to 0.4.7
4 years ago
Ross Wightman
ea9c9550b2
Fully move ViT hybrids to their own file, including embedding module. Remove some extra DeiT models that were for benchmarking only.
4 years ago
Ross Wightman
a5310a3451
Merge remote-tracking branch 'origin/benchmark-fixes-vit_hybrids' into pit_and_vit_update
4 years ago
Ross Wightman
7953e5d11a
Fix pos_embed scaling for ViT and num_classes != 1000 for pretrained distilled deit and pit models. Fix #426 and fix #433
4 years ago
Ross Wightman
a760a4c3f4
Some ViT cleanup, merge distilled model with main, fixup torchscript support for distilled models
4 years ago
Ross Wightman
0dfc5a66bb
Add PiT model from https://github.com/naver-ai/pit
4 years ago
Ross Wightman
51febd869b
Small tweak to tests for tnt model, reorder model imports.
4 years ago
Ross Wightman
b27a4e0d88
Merge branch 'master' of https://github.com/contrastive/pytorch-image-models into contrastive-master
4 years ago
Aman Arora
6b18061773
Add GIST to docstring for quick access
4 years ago
contrastive
de86314655
Update TNT
4 years ago
Aman Arora
92b1db9a79
update docstrings and add check on and
4 years ago
Aman Arora
b85be24054
update to work with fnmatch
4 years ago
contrastive
cfc15283a4
Update TNT url
4 years ago
contrastive
4a09bc851e
Add TNT model
4 years ago
Aman Arora
20626e8387
Add to extract stats for SPP
4 years ago
Ross Wightman
cf5fec5047
Cleanup experimental vit weight init a bit
4 years ago
Ross Wightman
f42f1df26c
Improve evenness of per-worker split for validation set with TFDS
4 years ago
Ross Wightman
cbcb76d72c
Should have included Conv2d layers in original weight init. Lets see what the impact is...
4 years ago
Ross Wightman
4de57ccf01
Add weight init scheme that's closer to JAX impl
4 years ago
Ross Wightman
14ac4abf74
Change huggingface hub revision delimiter to '@', add hf_hub reference for eca_nfnet_l0 model as an example.
4 years ago
Ross Wightman
740f32c96a
Add ECA-NFNet-L0 weights and update model name. Update README and bump version to 0.4.6
4 years ago
Ross Wightman
5e2e4e7fb6
Merge pull request #502 from khatami-mehrdad/efficientnet_dg
...
adding efficientnet_el, efficientnet_es_pruned and efficientnet_el_pruned pre-trained models
4 years ago
Ross Wightman
45c048ba13
A few minor fixes and bit more cleanup on the huggingface hub integration.
4 years ago
Mehrdad
97f49d163c
adding efficientnet_el, efficientnet_es_prunedefficientnet_es_pruned and efficientnet_el_pruned pretrained models to timm/models
4 years ago
Ross Wightman
ead80d33c5
Fix typo, naming consistency
4 years ago
Ross Wightman
d584e7f617
Support for huggingface hub via create_model and default_cfgs.
...
* improve consistency of model creation helper fns
* add comments to some of the model helpers
* support passing external default_cfgs so they can be sourced from hub
4 years ago
Ross Wightman
1f799afcb3
Remove capital letters in model names
4 years ago
AFLALO, Jonathan Isaac
82c1fe958a
added hardcorenas model
...
refactoring to support feature extraction
4 years ago
Ross Wightman
f57db99101
Update README, fix iabn pip version print.
4 years ago
Ross Wightman
7ef7788ee9
Fix CUDA crash w/ channels-last + CSP models. Remove use of chunk()
4 years ago
Ross Wightman
317ea3e599
Tweak NFNet, ResNetV2, RexNet feature extraction, use pre-act features for simplicty. Update tests for pytorch 1.8. 0.4.5 version bump.
4 years ago
Ross Wightman
17cdee7354
Fix C&P patch_size error, and order of op patch_size arg resolution bug. Remove a test vit model.
4 years ago
Ross Wightman
0706d05d52
Benchmark models listed in txt file. Add more hybrid vit variants for testing
4 years ago
Ross Wightman
2db2d87ff7
Add epoch-repeats arg to multiply the number of dataset passes per epoch. Currently for iterable datasets (read TFDS wrapper) only.
4 years ago
Ross Wightman
de97be9146
Spell out diff between my small and deit small vit models.
4 years ago
Ross Wightman
f0ffdf89b3
Add numerous experimental ViT Hybrid models w/ ResNetV2 base. Update the ViT naming for hybrids. Fix #426 for pretrained vit resizing.
4 years ago
Ross Wightman
0e16d4e9fb
Add benchmark.py script, and update optimizer factory to be more friendly to use outside of argparse interface.
4 years ago
Ross Wightman
4bc103f504
Fix CUDA crash w/ channels-last + CSP models. Remove use of chunk()
4 years ago
Ross Wightman
8563609b28
Update notes in ScaledStdConv impl
4 years ago
Ross Wightman
678ba4e0a2
Add NFNet-F model weights ported from DeepMind Haiku impl and new set of models w/ compatible config.
4 years ago
Ross Wightman
9de2ec5e44
Update README for AGC and bump version to 0.4.4
4 years ago
Ross Wightman
4f49b94311
Initial AGC impl. Still testing.
4 years ago
Ross Wightman
5f9aff395c
Fix stem width in NFNet-F models, add some more comments, add some 'light' NFNet models for testing.
4 years ago
Ross Wightman
d86dbe45c2
Update README.md and few more comments
4 years ago
Ross Wightman
0d253e2c5e
Fix issue with nfnet tests, bit more cleanup.
4 years ago
Ross Wightman
cb06c7a910
Add NFNet-F models and tweak existing NF models.
4 years ago
Ross Wightman
e4de077021
Add first 'Normalizer Free' models. nf_regnet_b1 79.3 @ 288x288 test, and nf_resnet50 80.3 @ 256x256 test (80.68 @ 288x288).
4 years ago
Ross Wightman
d8e69206be
Merge pull request #419 from rwightman/byob_vgg_models
...
More models, GPU-Efficient Nets, RepVGG, classic VGG, and flexible Byob backbone.
4 years ago
Ross Wightman
ca9b078ac7
Update README.md and docs. Version bumped to 0.4.3
4 years ago
Ross Wightman
6853b07bbd
Improve RegVGG block identity/vs non for clariy and fix attn usage. Add comments.
4 years ago
Ross Wightman
0356e773f5
Default to native PyTorch AMP instead of APEX amp. Too many APEX issues cropping up lately.
4 years ago
Reuben
94ca140b67
update collections.abc import
4 years ago
Ross Wightman
b4e216e377
Fix a few small things.
4 years ago
Ross Wightman
dc85e5a237
Add ByobNet w/ GPU-EfficientNets and RepVGG. Also add classic vgg models.
4 years ago
Ross Wightman
1bcc69e0ad
Use in_channels for depthwise groups, allows using `out_channels=N * in_channels` (does not impact existing models). Fix #354 .
4 years ago
Ross Wightman
9811e229f7
Fix regression in models with 1001 class pretrained weights. Improve batchnorm arg and BatchNormAct layer handling in several models.
4 years ago
Ross Wightman
a39c3ee216
Merge branch 'master' into eca-weights
4 years ago
Ross Wightman
e9d6fe293c
Update README for new weights. Version 0.4.2
4 years ago
Ross Wightman
666de85cf1
Move stride in EdgeResidual block to 3x3 expansion conv. Fix #414
4 years ago
Ross Wightman
3b57490a63
Fix some half removed resnet model defs, pooling for ecaresnet269d
4 years ago
Ross Wightman
68a4144882
Add new weights for ecaresnet26t/50t/269d models. Remove distinction between 't' and 'tn' (tiered models), tn is now t. Add test time img size spec to default cfg.
4 years ago
Ross Wightman
b9843f954b
Merge pull request #282 from tigert1998/patch-1
...
Add symbolic for SwishJitAutoFn to support onnx
4 years ago
hwangdeyu
7a4be5c035
add operator HardSwishJitAutoFn export to onnx
4 years ago
Ross Wightman
4203efa36d
Fix #387 so that checkpoint saver works with max history of 1. Add checkpoint-hist arg to train.py.
4 years ago
Ross Wightman
f0e65e37b7
Fix NF-ResNet101 model defs
4 years ago
Ross Wightman
2c988c3b6e
Update README.md for NF-nets, bump version to 0.4.1 for merge
4 years ago
Ross Wightman
2de54d174a
Fix pool size defs for NFNet models, add a comment.
4 years ago
Ross Wightman
90980de4a9
Fix up a few details in NFResNet models, managed stable training. Add support for gamma gain to be applied in activation or ScaleStdConv. Some tweaks to ScaledStdConv.
4 years ago
Ross Wightman
5a8e1e643e
Initial Normalizer-Free Reg/ResNet impl. A bit of related layer refactoring.
4 years ago
Ross Wightman
38d8f67570
Fix potential issue with change to num_classes arg in train/validate.py defaulting to None (rely on model def / default_cfg)
4 years ago
Ross Wightman
587780e56b
Update README.md and bump version to 0.4.0
4 years ago
Ross Wightman
bb50ac4708
Add DeiT distilled weights and distilled model def. Remove some redudant ViT model args.
4 years ago
Ross Wightman
c16e965037
Add some ViT comments and fix a few minor issues.
4 years ago
Ross Wightman
22748f1a2d
Convert samples/targets in ParserImageInTar to numpy arrays, slightly less mem usage for massive datasets. Add a few more se/eca model defs to resnet.py
4 years ago
Ross Wightman
5d4c3d0af3
Add enhanced ParserImageInTar that can read images from tars within tars, folders with multiple tars, etc. Additional comment cleanup.
4 years ago
Ross Wightman
55f7dfa9ea
Refactor vision_transformer entrpy fns, add pos embedding resize support for fine tuning, add some deit models for testing
4 years ago
Ross Wightman
d55bcc0fee
Finishing adding stochastic depth support to BiT ResNetV2 models
4 years ago
Ross Wightman
855d6cc217
More dataset work including factories and a tensorflow datasets (TFDS) wrapper
...
* Add parser/dataset factory methods for more flexible dataset & parser creation
* Add dataset parser that wraps TFDS image classification datasets
* Tweak num_classes handling bug for 21k models
* Add initial deit models so they can be benchmarked in next csv results runs
4 years ago
Ross Wightman
20516abc18
Fix some broken tests for ResNetV2 BiT models
4 years ago
Ross Wightman
59ec7e6a53
Merge branch 'master' into imagenet21k_datasets_more
4 years ago
Ross Wightman
e7a9ddf982
Merge pull request #334 from kecsap/links
...
Follow symbolic links during dataset scanning
4 years ago
Csaba Kertesz
7cae7e7035
Follow links during dataset scanning
4 years ago
Ross Wightman
c96e9f99a0
Update version to 0.3.3
4 years ago
Ross Wightman
4e2533db77
Add 320x320 model default cfgs for 101D and 152D ResNets. Add SEResNet-152D weights and 320x320 cfg.
4 years ago
Ross Wightman
0167f749d3
Remove some old __future__ imports
4 years ago
Ross Wightman
e35e9760a6
More work on dataset / parser split and imagenet21k (tar) support
4 years ago
Ross Wightman
ce69de70d3
Add 21k weight urls to vision_transformer. Cleanup feature_info for preact ResNetV2 (BiT) models
4 years ago
Ross Wightman
231d04e91a
ResNetV2 pre-act and non-preact model, w/ BiT pretrained weights and support for ViT R50 model. Tweaks for in21k num_classes passing. More to do... tests failing.
4 years ago
Ross Wightman
de6046e213
Initial commit for dataset / parser reorg to support additional datasets / types
4 years ago
Ross Wightman
392595c7eb
Add pool_size to default cfgs for new models to prevent tests from failing. Add explicit 200D_320 model entrypoint for next benchmark run.
4 years ago
Ross Wightman
b1f1228a41
Add ResNet101D, 152D, and 200D weights, remove meh 66d model
4 years ago
Jasha
7c56c718f3
Configure create_optimizer with args.opt_args
...
Closes #301
4 years ago
Ross Wightman
9a25fdf3ad
Merge pull request #297 from rwightman/ema_simplify
...
Simplified JIT compatible Ema module. Fixes for SiLU export and torchscript training w/ Linear layer.
4 years ago
Tymoteusz Wiśniewski
de15b43865
Fix a bug with accuracy retrieving from RealLabels
4 years ago
Ross Wightman
cd72e66eff
Bug in last mod for features_only default_cfg
4 years ago
Ross Wightman
867a0e5a04
Add default_cfg back to models wrapped in feature extraction module as per discussion in #294 .
4 years ago
Ross Wightman
4ca52d73d8
Add separate set and update method to ModelEmaV2
4 years ago
Ross Wightman
2ed8f24715
A few more changes for 0.3.2 maint release. Linear layer change for mobilenetv3 and inception_v3, support no bias for linear wrapper.
4 years ago
Ross Wightman
6504a42832
Version 0.3.2
4 years ago
Ross Wightman
460eba7f24
Work around casting issue with combination of native torch AMP and torchscript for Linear layers
4 years ago
Ross Wightman
5f4b6076d8
Fix inplace arg compat for GELU and PreLU via activation factory
4 years ago
Ross Wightman
fd962c4b4a
Native SiLU (Swish) op doesn't export to ONNX
4 years ago
Ross Wightman
27bbc70d71
Add back old ModelEma and rename new one to ModelEmaV2 to avoid compat breaks in dependant code. Shuffle train script, add a few comments, remove DataParallel support, support experimental torchscript training.
4 years ago
tigertang
43f2500c26
Add symbolic for SwishJitAutoFn to support onnx
4 years ago
Ross Wightman
9214ca0716
Simplifying EMA...
4 years ago
Ross Wightman
53aeed3499
ver 0.3.1
4 years ago
Ross Wightman
30ab4a1494
Fix issue in optim factory with sgd / eps flag. Bump version to 0.3.1
4 years ago
Ross Wightman
741572dc9d
Bump version to 0.3.0 for pending PyPi push
4 years ago
Ross Wightman
b401952caf
Add newly added vision transformer large/base 224x224 weights ported from JAX official repo
4 years ago
Ross Wightman
61200db0ab
in_chans=1 working w/ pretrained weights for vision_transformer
4 years ago
Ross Wightman
e90edce438
Support native silu activation (aka swish). An optimized ver is available in PyTorch 1.7.
4 years ago
Ross Wightman
da6cd2cc1f
Fix regression for pretrained classifier loading when using entrypt functions directly
4 years ago
Ross Wightman
f591e90b0d
Make sure num_features attr is present in vit models as with others
4 years ago
Ross Wightman
4a3df7842a
Fix topn metric view regression on PyTorch 1.7
4 years ago
Ross Wightman
f944242cb0
Fix #262 , num_classes arg mixup. Make vision_transformers a bit closer to other models wrt get/reset classfier/forward_features. Fix torchscript for ViT.
4 years ago
Ross Wightman
736f209e7d
Update vision transformers to be compatible with official code. Port official ViT weights from jax impl.
4 years ago
Ross Wightman
477a78ed81
Fix optimizer factory regressin for optimizers like sgd/momentum that don't have an eps arg
4 years ago
Ross Wightman
27a93e9de7
Improve test crop for ViT models. Small now 77.85, added base weights at 79.35 top-1.
4 years ago
Ross Wightman
d4db9e7977
Add small vision transformer weights. 77.42 top-1.
4 years ago
talrid
27fadaa922
asymmetric_loss
4 years ago
Ross Wightman
f31933cb37
Initial Vision Transformer impl w/ patch and hybrid variants. Refactor tuple helpers.
4 years ago
Ross Wightman
a4d8fea61e
Add model based wd skip support. Improve cross version compat of optimizer factory. Fix #247
4 years ago
Ross Wightman
80078c47bb
Add Adafactor and Adahessian optimizers, cleanup optimizer arg passing, add gradient clipping support.
4 years ago
Ross Wightman
fcb6258877
Add missing leaky_relu layer factory defn, update Apex/Native loss scaler interfaces to support unscaled grad clipping. Bump ver to 0.2.2 for pending release.
4 years ago
Ross Wightman
e8e2d9cabf
Add DropPath (stochastic depth) to ReXNet and VoVNet. RegNet DropPath impl tweak and dedupe se args.
4 years ago
Ross Wightman
e8ca45854c
More models in sotabench, more control over sotabench run, dataset filename extraction consistency
4 years ago
Ross Wightman
9c406532bd
Add EfficientNet-EdgeTPU-M (efficientnet_em) model trained natively in PyTorch. More sotabench fiddling.
4 years ago
Ross Wightman
c40384f5bd
Add ResNet weights. 80.5 (top-1) ResNet-50-D, 77.1 ResNet-34-D, 72.7 ResNet-18-D.
4 years ago
Ross Wightman
47a7b3b5b1
More flexible mixup mode, add 'half' mode.
4 years ago
Ross Wightman
532e3b417d
Reorg of utils into separate modules
4 years ago
Ross Wightman
33f8a1bf36
Updated README, add wide_resnet50_2 and seresnext50_32x4d weights
4 years ago
Ross Wightman
751b0bba98
Add global_pool (--gp) arg changes to allow passing 'fast' easily for train/validate to avoid channels_last issue with AdaptiveAvgPool
4 years ago
Ross Wightman
9c297ec67d
Cleanup Apex vs native AMP scaler state save/load. Cleanup CheckpointSaver a bit.
4 years ago
Ross Wightman
80c9d9cc72
Add 'fast' global pool option, remove redundant SEModule from tresnet, normal one is now 'fast'
4 years ago
Ross Wightman
90a01f47d1
hrnet features_only pretrained weight loading issue. Fix #232 .
4 years ago
Ross Wightman
110a7c4982
AdaptiveAvgPool2d -> mean((2,3)) for all SE/attn layers to avoid NaN with AMP + channels_last layout. See https://github.com/pytorch/pytorch/issues/43992
4 years ago
Ross Wightman
c2cd1a332e
Improve torch amp support and add channels_last support for train/validate scripts
4 years ago
Ross Wightman
470220b1f4
Fix MobileNetV3 crash with global_pool='', output consistent with other models but not equivalent due to efficient head.
4 years ago
Ross Wightman
fc8b8afb6f
Fix a silly bug in Sample version of EvoNorm missing x* part of swish, update EvoNormBatch to accumulated unbiased variance.
4 years ago
Ross Wightman
f614df7921
Bump version to 0.2.1 and update README
4 years ago
Ross Wightman
b423bc8362
Merge pull request #218 from rwightman/cutmix
...
CutMix + MixUp overhaul
4 years ago
Ross Wightman
8c9814e3f5
Final cleanup of mixup/cutmix. Element/batch modes working with both collate (prefetcher active) and without prefetcher.
4 years ago
Ross Wightman
0f5d9d8166
Add CSPResNet50 weights, 79.6 top-1 at 256x256
4 years ago
Ross Wightman
b1b6e7c361
Fix a few more issues related to #216 w/ TResNet (space2depth) and FP16 weights in wide resnets. Also don't completely dump pretrained weights in in_chans != 1 or 3 cases.
4 years ago
Ross Wightman
512b2dd645
Add new EfficientNet-B3 and RegNetY-3.2GF weights, both just over 82 top-1
4 years ago
Ross Wightman
6890300877
Add DropPath (stochastic depth) to RegNet
4 years ago
Ross Wightman
cd23f55397
Fix mixed prec issues with new mixup code
4 years ago
Yusuke Uchida
f6b56602f9
fix test_model_default_cfgs
4 years ago
Ross Wightman
f471c17c9d
More cutmix/mixup overhaul, ready to kick-off some trials.
4 years ago
Ross Wightman
d5145fa4d5
Change default_cfg names for senet to include the legacy and match model names
4 years ago
Ross Wightman
92f2d0d65d
Merge branch 'master' into cutmix. Fixup a few issues.
4 years ago
Ross Wightman
1696499ce5
Bump version to 0.2.0, ready to roll (I think)
4 years ago
Ross Wightman
e62758cf4f
More documentation updates, fix a typo
4 years ago
Ross Wightman
dfe80414a6
Add bool arg helper
4 years ago
Ross Wightman
fa28067704
Add more augmentation arguments, including a no_aug disable flag. Fix #209
4 years ago
Ross Wightman
b1f1a54de9
More uniform treatment of classifiers across all models, reduce code duplication.
4 years ago
Ross Wightman
d72ddafe56
Fix some checkpoint / model str regressions
4 years ago
Ross Wightman
ac18adb9c3
Remove debug print from RexNet
4 years ago
Ross Wightman
c53ec33ae0
Add synset/label indices for results generation. Add 'valid labels' to validation script to support imagenet-a/r label subsets properly.
4 years ago
Ross Wightman
ec4976fdba
Add EfficientNet-Lite0 weights trained with this code by @hal-314, 75.484 top-1
4 years ago
Ross Wightman
9ecd16bd7b
Add new seresnet50 (non-legacy) model weights, 80.274 top-1
4 years ago
Ross Wightman
7995295968
Merge branch 'logger' into features. Change 'logger' to '_logger'.
4 years ago
Ross Wightman
1998bd3180
Merge branch 'feature/AB/logger' of https://github.com/antoinebrl/pytorch-image-models into logger
4 years ago
Ross Wightman
6c17d57a2c
Fix some attributions, add copyrights to some file docstrings
4 years ago
Ross Wightman
a69c0e04f0
Fix pool size in cspnet
4 years ago
Ross Wightman
14ef7a0dd6
Rename csp.py -> cspnet.py
4 years ago
Ross Wightman
ec37008432
Add pretrained weight links to CSPNet for cspdarknet53, cspresnext50
4 years ago
Sangdoo Yun
e93e571f7a
Add `adamp` and 'sgdp' optimizers.
...
Update requirements.txt
Update optim_factory.py
Add `adamp` optimizer
Update __init__.py
copy files of adamp & sgdp
Create adamp.py
Update __init__.py
Create sgdp.py
Update optim_factory.py
Update optim_factory.py
Update requirements.txt
Update adamp.py
Update sgdp.py
Update sgdp.py
Update adamp.py
4 years ago
Ross Wightman
08016e839d
Cleanup FeatureInfo getters, add TF models sourced Xception41/65/71 weights
4 years ago
Ross Wightman
7ba5a384d3
Add ReXNet w/ remapped weights, feature support
4 years ago
Ross Wightman
c9d54bc1c3
Add HRNet feature extraction, fix senet type, lower feature testing res to 96x96
4 years ago
Ross Wightman
2ac663f340
Add feature support to legacy senets, add 32x32 resnext models to exclude list for feature testing.
4 years ago
Ross Wightman
c146b54abc
Cleanup EfficientNet/MobileNetV3 feature extraction a bit, only two tap locations now, small mobilenetv3 models work
4 years ago
Ross Wightman
68fd8a267b
Merge branch 'master' into features
4 years ago
Ross Wightman
4e61c6a12d
Cleanup, refactoring of Feature extraction code, add tests, fix tests, non hook feature extraction working with torchscript
4 years ago
Ross Wightman
6eec3fb4a4
Move FeatureHooks into features.py, switch EfficientNet, MobileNetV3 to use build model helper
4 years ago
Ross Wightman
9eba134d79
More models supporting feature extraction, xception, gluon_xception, inception_v3, inception_v4, pnasnet, nasnet, dla. Fix DLA unused projection params.
4 years ago
Ross Wightman
298fba09ac
Back out some activation hacks trialing upcoming pytorch changes
4 years ago
Ross Wightman
3b9004bef9
Lots of changes to model creation helpers, close to finalizing feature extraction / interfaces
4 years ago
Ross Wightman
e2cc481310
Update CSP ResNets for cross expansion without activation. Fix VovNet IABN compatibility with fixed activation arg.
4 years ago
Ross Wightman
3b6cce4c95
Add initial impl of CrossStagePartial networks, yet to be trained, not quite the same as darknet cfgs.
4 years ago
Ross Wightman
3aebc2f06c
Switch DPN to use BnAct layer, train a new DPN 68b model with RA to 79.21
4 years ago
Ross Wightman
f122f0274b
Significant ResNet refactor:
...
* stage creation + make_layer moved to separate fn with more sensible dilation/output_stride calc
* drop path rate decay easy to impl with refactored block creation loops
* fix dilation + blur pool combo
4 years ago
Ross Wightman
a66df5fb91
More model feature extraction support, start to deprecate senet.py, dilations added to regnet, add proper aligned xception
4 years ago
Ross Wightman
7729f40dca
Fix another bug, update all gluon resnet models to use new creation method (feature support)
4 years ago
Ross Wightman
d72ac0db25
Fix #173 , lr cycle default 0 vs 1. Fix #177 , mirror resnest weights for future stability.
4 years ago
Ross Wightman
d0113f9cdb
Fix a few issues that came up in tests
4 years ago
Ross Wightman
d23a2697d0
Working on feature extraction, interfaces refined, a number of models working, some in progress.
4 years ago
Ross Wightman
24e7535278
Remove tests from distrib
4 years ago
Ross Wightman
328339a086
Add ESE-VoVNet-19-DW weights
4 years ago
Ross Wightman
2d83752bc2
Fix default interpolation/crop of largest 2 ResNeSt models
4 years ago
Ross Wightman
f225ae8e59
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max.
4 years ago
Ross Wightman
39f27c1add
Almost ready to merge, vovnet typo, version bump, readme addition
4 years ago
Ross Wightman
151679c2f1
Add custom grad tests, fix cut & paste error with hard_mish ME, add a few more pytorch act fns to factory
4 years ago
Ross Wightman
6c7932fe75
Update sotabench.py, tweak VovNet cfg
4 years ago
Ross Wightman
0aca08384f
Update regnet cfg keys to match model names so registry works properly
4 years ago
Antoine Broyelle
78fa0772cc
Leverage python hierachical logger
...
with this update one can tune the kind of logs generated by timm but
training and inference traces are unchanged
4 years ago
Ross Wightman
13c0a6290e
Add ESE-VovNet39 weights, 79.32 top-1
5 years ago
Ross Wightman
4ddde1d3a4
Fix two regressions
5 years ago
Ross Wightman
a7e8cadd15
Remove pointless densenet configs, add an iabn version of 264 as it makes more sense to try someday...
5 years ago
Ross Wightman
e78daf586a
better densenet121 and densenetblur121d weights
5 years ago
Ross Wightman
7be299504f
Add missing feature_info() on MobileNetV3, make hook feature output order/type consistent with bottleneck (list, decreasing fmap size)
5 years ago
Ross Wightman
88129b2569
Add set_layer_config contextmgr to adjust all layer configs at once, use in create_module with new args. Remove a few old warning causing constant annotations for jit.
5 years ago
Ross Wightman
f28170df3f
Fix an an untested change, remove a debug print
5 years ago
Ross Wightman
eb7653614f
Monster commit, activation refactor, VoVNet, norm_act improvements, more
...
* refactor activations into basic PyTorch, jit scripted, and memory efficient custom auto
* implement hard-mish, better grad for hard-swish
* add initial VovNet V1/V2 impl, fix #151
* VovNet and DenseNet first models to use NormAct layers (support BatchNormAct2d, EvoNorm, InplaceIABN)
* Wrap IABN for any models that use it
* make more models torchscript compatible (DPN, PNasNet, Res2Net, SelecSLS) and add tests
5 years ago
Ross Wightman
0ea53cecc3
Merge branch 'master' into densenet_update_and_more
5 years ago
Ross Wightman
6441e9cc1b
Fix memory_efficient mode for DenseNets. Add AntiAliasing (Blur) support for DenseNets and create one test model. Add lr cycle/mul params to train args.
5 years ago
Ross Wightman
afb6bd0669
Add backward and default_cfg tests and fix a few issues found. Fix #153
5 years ago
Ross Wightman
50658b9a67
Add RegNet models and weights
5 years ago
Ross Wightman
7df83258c9
Merge branch 'master' into densenet_update_and_more
5 years ago
Ross Wightman
1904ed8fec
Improve dropblock impl, add fast variant, and better AMP speed, inplace, batchwise... few ResNeSt cleanups
5 years ago
Ross Wightman
17270c69b9
Remove annoying InceptionV3 dependency on scipy and insanely slow trunc_norm init. Bring InceptionV3 code into this codebase and use upcoming torch trunch_norm_ init.
5 years ago
Ross Wightman
c4ca016656
Merge pull request #145 from rwightman/resnest
...
ResNeSt
5 years ago
Ross Wightman
208e7912f7
Missed one of the abalation model entrypoints, update README
5 years ago
Ross Wightman
9cc289f18c
Exclude EfficientNet-L2 models from test
5 years ago
Ross Wightman
2f884a0ce5
Add resnest14, resnest26, and two of the abalation grouped resnest50 models
5 years ago
Ross Wightman
f4cdc2ac31
Add ResNeSt models
5 years ago
Ross Wightman
780860d140
Add norm_act factory method, move JIT of norm layers to factory
5 years ago
Ross Wightman
14edacdf9a
DenseNet converted to support ABN (norm + act) modules. Experimenting with EvoNorm, IABN
5 years ago
Ross Wightman
022ed001f3
Update DenseNet to latest in Torchvision (torchscript compat, checkpointing, proper init). Start adding ehanced configurability, stem options...
5 years ago
Vyacheslav Shults
a7ebe09029
Replace all None by nn.Identity() in all models reset_classifier when False-values num_classes is given.
...
Make small code refactoring
5 years ago
Vyacheslav Shults
f0eb021620
Replace all None by nn.Identity() in HRNet modules
5 years ago
AFLALO, Jonathan Isaac
a7f570c9b7
added MultiEpochsDataLoader
5 years ago
Ross Wightman
8d8677e03b
Fix #139 . Broken SKResNets after BlurPool addition, as a plus, SKResNets support AA now too.
5 years ago
Ross Wightman
c9b6f415ca
Bump version for Pypi release
5 years ago
AFLALO, Jonathan Isaac
9c15d57505
added efficientnet pruned weights
5 years ago
Ross Wightman
ea300709f0
Fix model create fn not passing num_classes through. Fix #135
5 years ago
Ross Wightman
2c438c4e0d
Merge branch 'master' of github.com:rwightman/pytorch-models
5 years ago
Ross Wightman
2681a8d618
Final blurpool2d cleanup and add resnetblur50 weights, match tresnet Downsample arg order to BlurPool2d for interop
5 years ago
Ross Wightman
9590f301a9
Merge branch 'blur' of https://github.com/VRandme/pytorch-image-models into VRandme-blur
5 years ago
Ross Wightman
a5220ad3aa
Merge pull request #125 from Separius/patch-1
...
fix typo in eca
5 years ago
Ross Wightman
0834fbc01c
Move pruned model adapt strings to separate txt files. A few minor formatting alignment tweaks.
5 years ago
Ross Wightman
ba793f5c1c
Merge branch 'adding_ECA_resnet' of https://github.com/yoniaflalo/pytorch-image-models into yoniaflalo-adding_ECA_resnet
5 years ago
AFLALO, Jonathan Isaac
07f19dd699
added eca resnet
5 years ago
Ross Wightman
1d4ac1b165
Bump version for pypi release. Fix #130
5 years ago
Ross Wightman
be7c784d21
Small TResNet simplification, just use SelectAdaptivePool, don't notice any perf difference
5 years ago
Ross Wightman
e3a98171b2
Remove some CRLF that made it into TResNet merge
5 years ago
Ross Wightman
0004f37d25
Some TResNet cleanup.
...
* allow use of global pool arg, test-time-pooling
* clean checkpoints to just contain state dict, add 448 res checkpoints
* support DataParallel via lazy filter creation for JIT Downsample
* some minor formatting (mostly alignment) preferences
5 years ago
Ross Wightman
64fe37d008
Remove old mean/std helper, rely fully on cmd line or default_cfg now. Fixes #126
5 years ago
Ross Wightman
e6f24e5578
Add 'momentum' optimizer (SGD w/o nesterov) for stable EfficientDet training defaults
5 years ago
Sepehr Sameni
20290b5b56
fix typo in eca
5 years ago
Ross Wightman
ebf82b84ac
Merge pull request #122 from mrT23/master
...
TResNet models
5 years ago
Alexey Chernov
bdb165a8a4
Merge changes in feature extraction interface to MobileNetV3
...
Experimental feature extraction interface seems to be changed
a little bit with the most up to date version apparently found
in EfficientNet class. Here these changes are added to
MobileNetV3 class to make it support it and work again, too.
5 years ago
talrid
8a63c1add8
finalizing
5 years ago
talrid
6209146738
TResNet models
5 years ago
Ross Wightman
13cf68850b
Remove poorly named metrics from torch imagenet example origins. Use top1/top5 in csv output for consistency with existing validation results files, acc elsewhere. Fixes #111
5 years ago
Ross Wightman
56608c9070
Bump version for pypi
5 years ago
Ross Wightman
1a8f5900ab
Update EfficientNet feature extraction for EfficientDet. Add needed MaxPoolSame as well.
5 years ago
Chris Ha
06a50a94a8
Fix minor typos in create_attn.py and resnet.py
...
'eca'->'ceca'
and
doest not-> does not
5 years ago
Ross Wightman
1a9ab07307
One too many changes at a time, fix missing C
5 years ago
Ross Wightman
f17b42bc33
Blur filter no longer a buffer
5 years ago
Ross Wightman
6cdeca24a3
Some cleanup and fixes for initial BlurPool impl. Still some testing and tweaks to go...
5 years ago
Chris Ha
acd1b6cccd
Implement Functional Blur on resnet.py
...
1. add ResNet argument blur=''
2. implement blur for maxpool and strided convs in downsampling blocks
5 years ago
Chris Ha
ce3d82b58b
Update blurpool.py
...
clean up code for PR
5 years ago
Chris Ha
3a287a6e76
Create blurpool.py
...
Initial implementation of blur layer.
currently tests as correct against Downsample of original github
5 years ago
Ross Wightman
c99a5abed4
Merge pull request #115 from rwightman/mobilenetv2-experiment
...
MobileNet-V2 experiments
5 years ago
Ross Wightman
e34074b4da
Add final weights for MobileNet-V2 experiments
5 years ago
Ross Wightman
a6a5565de7
Fiddling...
5 years ago
Ross Wightman
5a16c533ff
Add better resnext50_32x4d weights trained by andravin
5 years ago
Ross Wightman
bc998cad91
Experimenting with some MobileNetV2 variations to compare against EfficientNet-Lite
5 years ago
Ross Wightman
3406e582cf
Add EfficientNet-Lite results, update README
5 years ago
Ross Wightman
bd05258f7b
EfficientNet-Lite model added w/ converted checkpoints, validation in progress...
5 years ago
Andrew Lavin
d92cc4da54
Modified save_checkpoint to always save last checkpoint. Fixes #98 .
5 years ago
Ross Wightman
56e2ac3a6d
Merge pull request #94 from rwightman/lr_noise
...
Learning rate noise, MobileNetV3 weights, and activate MobileNetV3/EfficientNet weight init change
5 years ago
Ross Wightman
c16f25ced2
Add MobileNetV3 Large weights, results, update README and sotabench for merge
5 years ago
Ross Wightman
c60069c1eb
Annotate types on drop fns to avoid torchscript error
5 years ago
Ross Wightman
9fee316752
Enable fixed fanout calc in EfficientNet/MobileNetV3 weight init by default. Fix #84
5 years ago
Ross Wightman
27b3680d49
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option.
5 years ago
Ross Wightman
cc5a11abba
version bump for PyPi update
5 years ago
Ross Wightman
514b0938c4
Experimenting with per-epoch learning rate noise w/ step scheduler
5 years ago
Ross Wightman
43225d110c
Unify drop connect vs drop path under 'drop path' name, switch all EfficientNet/MobilenetV3 refs to 'drop_path'. Update factory to handle new drop args.
5 years ago
Ross Wightman
f1d5f8a6c4
Update comments for Selective Kernel and DropBlock/Path impl, add skresnet34 weights
5 years ago
Ross Wightman
670c61b28f
Some cutmix/mixup cleanup/fixes
5 years ago
Ross Wightman
b3cb5f3275
Working on CutMix impl as per #8 , integrating with Mixup, currently experimenting...
5 years ago
Ross Wightman
569419b38d
Tweak some comments, add SKNet models with weights to sotabench, remove an unused branch
5 years ago
Ross Wightman
53c47479c4
Batch validation batch size adjustment, tweak L2 crop pct
5 years ago
Ross Wightman
08553e16b3
Merge branch 'master' into attention
5 years ago
Ross Wightman
fa38f24967
Update SK network configs, add weights for skresnet8 and skresnext50
5 years ago
Ross Wightman
f098fda2ca
Add map_location='cpu' to ModelEma resume, should improve #72
5 years ago
Ross Wightman
ba15ca47e8
Add ported EfficientNet-L2, B0-B7 NoisyStudent weights from TF TPU
5 years ago
Ross Wightman
5e6dbbaf30
Add CBAM for experimentation
5 years ago
Ross Wightman
d725991870
Remove debug print from ECA module
5 years ago
Ross Wightman
2a7d256fd5
Re-enable mem-efficient/jit activations after torchscript tests
5 years ago
Ross Wightman
f902bcd54c
Layer refactoring continues, ResNet downsample rewrite for proper dilation in 3x3 and avg_pool cases
...
* select_conv2d -> create_conv2d
* added create_attn to create attention module from string/bool/module
* factor padding helpers into own file, use in both conv2d_same and avg_pool2d_same
* add some more test eca resnet variants
* minor tweaks, naming, comments, consistency
5 years ago
Ross Wightman
a99ec4e7d1
A bunch more layer reorg, splitting many layers into own files. Improve torchscript compatibility.
5 years ago
Ross Wightman
13746a33fc
Big move, layer modules and fn to timm/models/layers
5 years ago
Ross Wightman
f54612f648
Merge branch 'select_kernel' into attention
5 years ago
Ross Wightman
4defbbbaa8
Fix module name mistake, start layers sub-package
5 years ago
Ross Wightman
7011cd0902
A little bit of ECA cleanup
5 years ago
Ross Wightman
46471df7b2
Merge pull request #82 from VRandme/eca
...
ECA-Net Efficient Channel Attention
5 years ago
Ross Wightman
d0eb59ef46
Remove unused default_init for EfficientNets, experimenting with fanout calc for #84
5 years ago
Chris Ha
e6a762346a
Implement Adaptive Kernel selection
...
When channel size is given,
calculate adaptive kernel size according to original paper.
Otherwise use the given kernel size(k_size), which defaults to 3
5 years ago
Ross Wightman
13e8da2b46
SelectKernel split_input works best when input channels split like grouped conv, but output is full width. Disable zero_init for SK nets, seems a bad combo.
5 years ago
Chris Ha
6db087a1ff
Merge remote-tracking branch 'upstream/master' into eca
5 years ago
Chris Ha
904c618040
Update EcaModule.py
...
Make pylint happy
(commas, unused imports, missed imports)
5 years ago
Chris Ha
db91ba053b
EcaModule(CamelCase)
...
CamelCased EcaModule.
Renamed all instances of ecalayer to EcaModule.
eca_module.py->EcaModule.py
5 years ago
Ross Wightman
5c4991a088
Add PyTorch trained EfficientNet-ES weights from Andrew Lavin
5 years ago
Chris Ha
d04ff95eda
Merge branch 'master' into eca
5 years ago
Chris Ha
d63ae121d5
Clean up eca_module code
...
functionally similar
adjusted rwightman's version of reshaping and viewing.
Use F.pad for circular eca version for cleaner code
5 years ago
Ross Wightman
d66819d1f3
Indentation mistake. Fixes #81
5 years ago
Chris Ha
f87fcd7e88
Implement Eca modules
...
implement ECA module by
1. adopting original eca_module.py into models folder
2. adding use_eca layer besides every instance of SE layer
5 years ago
Ross Wightman
4808b3c32f
Bump version for PyPi update, fix few out of date README items/mistakes, add README updates for TF EfficientNet-B8 (RandAugment)
5 years ago
Ross Wightman
7d07ebb660
Adding some configs to sknet, incl ResNet50 variants from 'Compounding ... Assembled Techniques' paper and original SKNet50
5 years ago
Ross Wightman
a9d2424fd1
Add separate zero_init_last_bn function to support more block variety without a mess
5 years ago
Ross Wightman
355aa152d5
Just leave it float for now, will look at fp16 later. Remove unused reference code.
5 years ago
Ross Wightman
ef457555d3
BlockDrop working on GPU
5 years ago
Ross Wightman
3ff19079f9
Missed nn_ops.py from last commit
5 years ago
Ross Wightman
9f11b4e8a2
Add ConvBnAct layer to parallel integrated SelectKernelConv, add support for DropPath and DropBlock to ResNet base and SK blocks
5 years ago
Ross Wightman
cefc9b7761
Move SelectKernelConv to conv2d_layers and more
...
* always apply attention in SelectKernelConv, leave MixedConv for no attention alternative
* make MixedConv torchscript compatible
* refactor first/previous dilation name to make more sense in ResNet* networks
5 years ago
Ross Wightman
9abe610931
Used wrong channel var for split
5 years ago
Ross Wightman
58e28dc7e7
Move Selective Kernel blocks/convs to their own sknet.py file
5 years ago
Ross Wightman
a93bae6dc5
A SelectiveKernelBasicBlock for more experiments
5 years ago
Ross Wightman
ad087b4b17
Missed bias=False in selection conv
5 years ago
Ross Wightman
c8b3d6b81a
Initial impl of Selective Kernel Networks. Very much a WIP.
5 years ago
Ross Wightman
1daa303744
Add support to Dataset for class id mapping file, clean up a bit of old logic. Add results file arg for validation and update script.
5 years ago
Ross Wightman
91534522f9
Add newly added TF ported EfficientNet-B8 weights (RandAugment)
5 years ago
Ross Wightman
12dbc74742
New ResNet50 JSD + RandAugment weights
5 years ago
Ross Wightman
2f41905ba5
Update ResNet50 weights to AuxMix trained 78.994 top-1. A few commentes re 'tiered_narrow' tn variant.
5 years ago
Ross Wightman
d9a6a9d0af
Merge pull request #74 from rwightman/augmix-jsd
...
AugMix, JSD loss, SplitBatchNorm (Auxiliary BN), and more
5 years ago
Ross Wightman
3eb4a96eda
Update AugMix, JSD, etc comments and references
5 years ago
Ross Wightman
a28117ea46
Add tiered narrow ResNet (tn) and weights for seresnext26tn_32x4d
5 years ago
Ross Wightman
833066b540
A few minor things in SplitBN
5 years ago
Ross Wightman
7547119891
Add SplitBatchNorm. AugMix, Rand/AutoAugment, Split (Aux) BatchNorm, Jensen-Shannon Divergence, RandomErasing all working together
5 years ago
Ross Wightman
2e955cfd0c
Update RandomErasing with some improved arg names, tweak to aspect range
5 years ago
Ross Wightman
3cc0f91e23
Fix augmix variable name scope overlap, default non-blended mode
5 years ago
Ross Wightman
ec0dd4053a
Add updated RandAugment trained EfficientNet-B0 trained weights from @michaelklachko
5 years ago
Ross Wightman
40fea63ebe
Add checkpoint averaging script. Add headers, shebangs, exec perms to all scripts
5 years ago
Ross Wightman
4666cc9aed
Add --pin-mem arg to enable dataloader pin_memory (showing more benefit in some scenarios now), also add --torchscript arg to validate.py for testing models with jit.script
5 years ago
Ross Wightman
53001dd292
ResNet / Res2Net additions:
...
* ResNet torchscript compat
* output_stride arg supported to limit network stride via dilations (support for dilation added to Res2Net)
* allow activation layer to be changed via act_layer arg
5 years ago
Ross Wightman
f96b3e5e92
InceptionResNetV2 torchscript compatible
5 years ago
Ross Wightman
19d93fe454
Add selecsls60 weights
5 years ago
Ross Wightman
0062c15fb0
Update checkpoint url with modelzoo compatible ones.
5 years ago
Ross Wightman
b5315e66b5
Streamline SelecSLS model without breaking checkpoint compat. Move cfg handling out of model class. Update feature/pooling behaviour to match current.
5 years ago
Ross Wightman
d59a756c16
Run PyCharm autoformat on selecsls and change mix cap variables and model names to all lower
5 years ago
Ross Wightman
fb3a0f4bb8
Merge pull request #65 from mehtadushy/selecsls
...
Incorporate SelecSLS Models
5 years ago