Ross Wightman
cfa414cad2
Matching two bits_and_tpu changes for TFDs wrapper
...
* change 'samples' -> 'examples' for tfds wrapper to match tfds naming
* add class_to_idx for image classification datasets in tfds wrapper
3 years ago
Martins Bruveris
5220711d87
Added B/8 models to ViT.
3 years ago
Alexander Soare
0262a0e8e1
fx ready for review
3 years ago
Alexander Soare
d2994016e9
Add try/except guards
3 years ago
Alexander Soare
b25ff96768
wip - pre-rebase
3 years ago
Alexander Soare
e051dce354
Make all models FX traceable
3 years ago
Alexander Soare
cf4561ca72
Add FX based FeatureGraphNet capability
3 years ago
Alexander Soare
0149ec30d7
wip - attempting to rebase
3 years ago
Alexander Soare
02c3a75a45
wip - make it possible to use fx graph in train and eval mode
3 years ago
Alexander Soare
a6c24b936b
Tests to enforce all models FX traceable
3 years ago
Alexander Soare
bc3d4eb403
wip -rebase
3 years ago
Alexander Soare
ab3ac3f25b
Add FX based FeatureGraphNet capability
3 years ago
Ross Wightman
65419f60cc
Merge pull request #964 from rwightman/more_datasets
...
Dataset additions
3 years ago
Ross Wightman
9ec3210c2d
More TFDS parser cleanup, support improved TFDS even_split impl (on tfds-nightly only currently).
3 years ago
Ross Wightman
ba65dfe2c6
Dataset work
...
* support some torchvision datasets
* improvements to TFDS wrapper for subsplit handling (fix #942 ), shuffle seed
* add class-map support to train (fix #957 )
3 years ago
Ross Wightman
ddc29da974
Add ResNet101 and ResNet152 weights from higher aug RSB recipes. 81.93 and 82.82 top-1 at 224x224.
3 years ago
Ross Wightman
b328e56f49
Update eca_halonext26ts weights to a better set
3 years ago
Ross Wightman
2ddef942b9
Better fix for #954 that doesn't break torchscript, pull torch._assert into timm namespace when it exists
3 years ago
Ross Wightman
4f0f9cb348
Fix #954 by bringing traceable _assert into timm to allow compat w/ PyTorch < 1.8
3 years ago
Ross Wightman
a41de1f666
Add interpolation mode handling to transforms. Removes InterpolationMode warning. Works for torchvision versions w/ and w/o InterpolationMode enum. Fix #738 .
3 years ago
Ross Wightman
ed41d32637
Add repr to auto_augment and random_erasing impl
3 years ago
Ross Wightman
135a48d024
Fix sam result again for imagenetv2
3 years ago
Ross Wightman
aaff2d82d0
Add new 50ts attn models to benchmark/meta csv files
3 years ago
Ross Wightman
1e17863b7b
Fixed botne*t26 model results, add some 50ts self-attn variants
3 years ago
Ross Wightman
ae72d009fa
Add weights for lambda_resnet50ts, halo2botnet50ts, lamhalobotnet50ts, updated halonet50ts
3 years ago
Ross Wightman
13178ba73a
Add benchmark and metadata csv files
3 years ago
Ross Wightman
b745d30a3e
Fix formatting of last commit
3 years ago
Ross Wightman
3478f1d7f1
Traceability fix for vit models for some experiments
3 years ago
Ross Wightman
f658a72e72
Cleanup re-use of Dropout modules in Mlp modules after some twitter feedback :p
3 years ago
Ross Wightman
71f00bfe9e
Don't run profile if model is torchscripted
3 years ago
Ross Wightman
7da1b0b61c
Merge pull request #933 from t-vi/unbind
...
use .unbind instead of explicitly listing the indices
3 years ago
Ross Wightman
5882e62ada
Add activation count to fvcore based profiling in benchmark.py
3 years ago
Ross Wightman
51f488b7f5
Update results.csv files with latest weights
3 years ago
Thomas Viehmann
f805ba86d9
use .unbind instead of explicitly listing the indices
3 years ago
Ross Wightman
57992509f9
Fix some formatting in utils/model.py
3 years ago
Ross Wightman
0fe4fd3f1f
add d8 and e8 regnetz models with group size 8
3 years ago
Ross Wightman
25e7c8c5e5
Update broken resnetv2_50 weight url, add resnetv1_101 a1h recipe weights for 224x224 train
3 years ago
Ross Wightman
f7325c7b71
Support either deepspeed or fvcore for flop profiling
3 years ago
Ross Wightman
66253790d4
Add `--bench profile` mode for benchmark.py to just run deepspeed detailed profile on model
3 years ago
Ross Wightman
13a8bf7972
Add train size override and deepspeed GMACs counter (if deepspeed installed) to benchmark.py
3 years ago
Ross Wightman
0ba73e6bcb
Update README.md
3 years ago
Ross Wightman
b6caa356d2
Fixed eca_botnext26ts_256 weights added, 79.27
3 years ago
Ross Wightman
c02334d9fa
Add weights for regnetz_d and haloregnetz_c, update regnetz_c weights. Add commented PyTorch XLA code for halo attention
3 years ago
Ross Wightman
02daf2ab94
Add option to include relative pos embedding in the attention scaling as per references. See discussion #912
3 years ago
Ross Wightman
2c33ca6d8c
Merge pull request #913 from ground0state/master
...
Fix bugs that Mixup does not work when device is cpu
3 years ago
masafumi
047a5ec05f
Fix bugs that Mixup does not work device=cpu
3 years ago
Ross Wightman
cd34913278
Remove some outdated comments, botnet networks working great now.
3 years ago
Ross Wightman
6ed4cdccca
Update lambda_resnet26t weights with better set
3 years ago
Ross Wightman
288ece0e9f
Merge pull request #910 from tmp-iclr/master
...
Add ConvMixer
3 years ago
ICLR Author
44d6d51668
Add ConvMixer
3 years ago