Ross Wightman
5895056dc4
Add openai b32 ft
2 years ago
Ross Wightman
9dea5143d5
Adding more clip ft variants
2 years ago
Ross Wightman
444dcba4ad
CLIP B16 12k weights added
2 years ago
Ross Wightman
dff4717cbf
Add clip b16 384x384 finetunes
2 years ago
Ross Wightman
883fa2eeaa
Add fine-tuned B/16 224x224 in1k clip models
2 years ago
Ross Wightman
9a3d2ac2d5
Add latest CLIP ViT fine-tune pretrained configs / model entrypt updates
2 years ago
Ross Wightman
42bbbddee9
Add missing model config
2 years ago
Ross Wightman
def68befa7
Updating vit model defs for mult-weight support trial (vit first). Prepping for CLIP (laion2b and openai) fine-tuned weights.
2 years ago
Ross Wightman
0dadb4a6e9
Initial multi-weight support, handled so old pretraing config handling co-exists with new tags.
2 years ago
hongxin xiang
653bdc7105
Fix comment: https://github.com/rwightman/pytorch-image-models/pull/1564#issuecomment-1326743424
2 years ago
hongxin xiang
bdc9fad638
Fix compatible BUG: QMNIST and ImageNet datasets do not exist in torchvision 0.10.1.
2 years ago
Wauplin
9b114754db
refactor push_to_hub helper
2 years ago
Wauplin
ae0a0db7de
Create repo before cloning with Repository.clone_from
2 years ago
Ross Wightman
803254bb40
Fix spacing misalignment for fast norm path in LayerNorm modules
2 years ago
Ross Wightman
475ecdfa3d
cast env var args for dataset readers to int
2 years ago
Hoan Nguyen
39190f5f44
Remove inplace operators when calculating the loss
...
Remove inplace operators to overcome the following error when using `asymmetric_loss`
```
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
```
2 years ago
Ross Wightman
6635bc3f7d
Merge pull request #1479 from rwightman/script_cleanup
...
Train / val script enhancements, non-GPU (ie CPU) device support, HF datasets support, TFDS/WDS dataloading improvements
2 years ago
Ross Wightman
0e6023f032
Merge pull request #1381 from ChristophReich1996/master
...
Fix typo in PositionalEncodingFourier
2 years ago
Ross Wightman
66f4af7090
Merge remote-tracking branch 'origin/master' into script_cleanup
2 years ago
Ross Wightman
d3961536c9
comment some debug logs for WDS dataset
2 years ago
Ross Wightman
e9dccc918c
Rename dataset/parsers -> dataset/readers, create_parser to create_reader, etc
2 years ago
Ross Wightman
8c28363dc9
Version 0.7.dev0 for master
2 years ago
nateraw
30bafd7347
🔖 add dev suffix to version tag
2 years ago
Ross Wightman
f67a7ee8bd
Set num_workers in Iterable WDS/TFDS datasets early so sample estimate is correct
2 years ago
Ross Wightman
cea8df3d0c
Version 0.6.12
2 years ago
Ross Wightman
9914f744dc
Add more maxxvit weights includ ConvNeXt conv block based experiments.
2 years ago
Ross Wightman
b1b024dfed
Scheduler update, add v2 factory method, support scheduling on updates instead of just epochs. Add LR to summary csv. Add lr_base scaling calculations to train script. Fix #1168
2 years ago
Ross Wightman
4f18d6dc5f
Fix logs in WDS parser
2 years ago
Mohamed Rashad
8fda68aff6
Fix repo id bug
...
This to fix this issue #1482
2 years ago
Ross Wightman
b8c8550841
Data improvements. Improve train support for in_chans != 3. Add wds dataset support from bits_and_tpu branch w/ fixes and tweaks. TFDS tweaks.
2 years ago
Alex Fafard
7327792f39
update to support pickle based dictionaries
2 years ago
Ross Wightman
1199c5a1a4
clip_laion2b models need 1e-5 eps for LayerNorm
2 years ago
Ross Wightman
87939e6fab
Refactor device handling in scripts, distributed init to be less 'cuda' centric. More device args passed through where needed.
2 years ago
Ross Wightman
c88947ad3d
Add initial Hugging Face Datasets parser impl.
2 years ago
Ross Wightman
e858912e0c
Add brute-force checkpoint remapping option
2 years ago
Ross Wightman
b293dfa595
Add CL SE module
2 years ago
Ross Wightman
2a296412be
Add Adan optimizer
2 years ago
Ross Wightman
5dc4343308
version 0.6.11
2 years ago
Ross Wightman
a383ef99f5
Make huggingface_hub necessary if it's the only source for a pretrained weight
2 years ago
Ross Wightman
33e30f8c8b
Remove layer-decay print
2 years ago
Ross Wightman
e069249a2d
Add hf hub entries for laion2b clip models, add huggingface_hub dependency, update some setup/reqs, torch >= 1.7
2 years ago
Ross Wightman
9d65557be3
Fix errant import
2 years ago
Ross Wightman
9709dbaaa9
Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP
2 years ago
Ross Wightman
a520da9b49
Update tresnet features_info for v2
2 years ago
Ross Wightman
c8ab747bf4
BEiT-V2 checkpoints didn't remove 'module' from weights, adapt checkpoint filter
2 years ago
Ross Wightman
73049dc2aa
Fix type in dla weight update
2 years ago
Ross Wightman
3599c7e6a4
version 0.6.10
2 years ago
Ross Wightman
e11efa872d
Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.
2 years ago
Ross Wightman
fa8c84eede
Update maxvit_tiny_256 weight to better iter, add coatnet / maxvit / maxxvit model defs for future runs
2 years ago
Ross Wightman
c1b3cea19d
Add maxvit_rmlp_tiny_rw_256 model def and weights w/ 84.2 top-1 @ 256, 84.8 @ 320
2 years ago
Ross Wightman
914544fc81
Add beitv2 224x224 checkpoints from https://github.com/microsoft/unilm/tree/master/beit2
2 years ago
Ross Wightman
dc90816f26
Add `maxvit_tiny_rw_224` weights 83.5 @ 224 and `maxvit_rmlp_pico_rw_256` relpos weights, 80.5 @ 256, 81.3 @ 320
2 years ago
Ross Wightman
f489f02ad1
Make gcvit window size ratio based to improve resolution changing support #1449 . Change default init to original.
2 years ago
Ross Wightman
7f1b223c02
Add maxvit_rmlp_nano_rw_256 model def & weights, make window/grid size dynamic wrt img_size by default
2 years ago
Ross Wightman
e6a4361306
pretrained_cfg entry for mvitv2_small_cls
2 years ago
Ross Wightman
f66e5f0e35
Fix class token support in MViT-V2, add small_class variant to ensure it's tested. Fix #1443
2 years ago
Ross Wightman
f1d2160d85
Update a few maxxvit comments, rename PartitionAttention -> PartitionAttenionCl for consistency with other blocks
2 years ago
Ross Wightman
eca6f0a25c
Fix syntax error (extra dataclass comma) in maxxvit.py
2 years ago
Ross Wightman
ff6a919cf5
Add --fast-norm arg to benchmark.py, train.py, validate.py
2 years ago
Ross Wightman
769ab4b98a
Clean up no_grad for trunc normal weight inits
2 years ago
Ross Wightman
48e1df8b37
Add norm/norm_act header comments
2 years ago
Ross Wightman
7c2660576d
Tweak init for convnext block using maxxvit/coatnext.
2 years ago
Ross Wightman
1d8d6f6072
Fix two default args in DenseNet blocks... fix #1427
2 years ago
Ross Wightman
527f9a4cb2
Updated to correct maxvit_nano weights...
2 years ago
Ross Wightman
b2e8426fca
Make k=stride=2 ('avg2') pooling default for coatnet/maxvit. Add weight links. Rename 'combined' partition to 'parallel'.
2 years ago
Ross Wightman
837c68263b
For ConvNeXt, use timm internal LayerNorm for fast_norm in non conv_mlp mode
2 years ago
Ross Wightman
cac0a4570a
More test fixes, pool size for 256x256 maxvit models
2 years ago
Ross Wightman
e939ed19b9
Rename internal creation fn for maxvit, has not been just coatnet for a while...
2 years ago
Ross Wightman
ffaf97f813
MaxxVit! A very configurable MaxVit and CoAtNet impl with lots of goodies..
2 years ago
Ross Wightman
8c9696c9df
More model and test fixes
2 years ago
Ross Wightman
ca52108c2b
Fix some model support functions
2 years ago
Ross Wightman
f332fc2db7
Fix some test failures, torchscript issues
2 years ago
Ross Wightman
6e559e9b5f
Add MViT (Multi-Scale) V2
2 years ago
Ross Wightman
43aa84e861
Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)
2 years ago
Ross Wightman
c486aa71f8
Add GCViT
2 years ago
Ross Wightman
fba6ecd39b
Add EfficientFormer
2 years ago
Ross Wightman
ff4a38e2c3
Add PyramidVisionTransformerV2
2 years ago
Ross Wightman
1d8ada359a
Add timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size
2 years ago
Ross Wightman
2544d3b80f
ConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs
2 years ago
Ross Wightman
13565aad50
Add edgenext_base model def & weight link, update to improve ONNX export #1385
2 years ago
Ross Wightman
8ad4bdfa06
Allow ntuple to be used with string values
2 years ago
Christoph Reich
faae93e62d
Fix typo in PositionalEncodingFourier
2 years ago
Ross Wightman
7430a85d07
Update README, bump version to 0.6.8
2 years ago
Ross Wightman
ec6a28830f
Add DeiT-III 'medium' model defs and weights
2 years ago
Ross Wightman
d875a1d3f6
version 0.6.7
2 years ago
Ross Wightman
6f103a442b
Add convnext_nano weights, 80.8 @ 224, 81.5 @ 288
2 years ago
Ross Wightman
4042a94f8f
Add weights for two 'Edge' block (3x3->1x1) variants of CS3 networks.
2 years ago
Ross Wightman
c8f69e04a9
Merge pull request #1365 from veritable-tech/fix-resize-pos-embed
...
Take `no_emb_class` into account when calling `resize_pos_embed`
2 years ago
Ceshine Lee
0b64117592
Take `no_emb_class` into account when calling `resize_pos_embed`
2 years ago
Jasha10
56c3a84db3
Update type hint for `register_notrace_module`
...
register_notrace_module is used to decorate types (i.e. subclasses of nn.Module).
It is not called on module instances.
2 years ago
Ross Wightman
1b278136c3
Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355
2 years ago
Ross Wightman
909705e7ff
Remove some redundant requires_grad=True from nn.Parameter in third party code
2 years ago
Ross Wightman
c5e0d1c700
Add dilation support to convnext, allows output_stride=8 and 16 use. Fix #1341
2 years ago
Ross Wightman
dc376e3676
Ensure all model entrypoint fn default to `pretrained=False` (a few didn't)
2 years ago
Ross Wightman
23b102064a
Add cs3sedarknet_x weights w/ 82.65 @ 288 top1. Add 2 cs3 edgenet models (w/ 3x3-1x1 block), remove aa from cspnet blocks (not needed)
2 years ago
Ross Wightman
0dbd9352ce
Add bulk_runner script and updates to benchmark.py and validate.py for better error handling in bulk runs (used for benchmark and validation result runs). Improved batch size decay stepping on retry...
2 years ago
Ross Wightman
92b91af3bb
version 0.6.6
2 years ago
Ross Wightman
05313940e2
Add cs3darknet_x, cs3sedarknet_l, and darknetaa53 weights from TPU sessions. Move SE btwn conv1 & conv2 in DarkBlock. Improve SE/attn handling in Csp/DarkNet. Fix leaky_relu bug on older csp models.
2 years ago
nateraw
51cca82aa1
👽 use hf_hub_download instead of cached_download
2 years ago
Ross Wightman
324a4e58b6
disable nvfuser for jit te/legacy modes (for PT 1.12+)
2 years ago