Alexander Soare
0149ec30d7
wip - attempting to rebase
3 years ago
Alexander Soare
02c3a75a45
wip - make it possible to use fx graph in train and eval mode
3 years ago
Alexander Soare
a6c24b936b
Tests to enforce all models FX traceable
3 years ago
Alexander Soare
bc3d4eb403
wip -rebase
3 years ago
Alexander Soare
ab3ac3f25b
Add FX based FeatureGraphNet capability
3 years ago
Ross Wightman
d9b0b3d60f
device arg wasn't removed from PrefetcherCuda instantiation of RE
3 years ago
Ross Wightman
80ca078aed
Fix a few bugs and formatting/naming issues
...
* Pass optimizer resume flag through to checkpoint / updater restore. Related to #961 but not clear how relates to crash.
* Rename monitor step args, cleanup handling of step_end_idx vs num_steps for consistent log output in either case
* Resume from proper epoch (ie next epoch relative to checkpoint)
3 years ago
Ross Wightman
65419f60cc
Merge pull request #964 from rwightman/more_datasets
...
Dataset additions
3 years ago
Ross Wightman
406c486ba2
Merge remote-tracking branch 'origin/more_datasets' into bits_and_tpu
3 years ago
Ross Wightman
07693f81b0
Validation fix since we don't have multi-GPU DataParallel support yet
3 years ago
Ross Wightman
9ec3210c2d
More TFDS parser cleanup, support improved TFDS even_split impl (on tfds-nightly only currently).
3 years ago
Ross Wightman
ba65dfe2c6
Dataset work
...
* support some torchvision datasets
* improvements to TFDS wrapper for subsplit handling (fix #942 ), shuffle seed
* add class-map support to train (fix #957 )
3 years ago
Ross Wightman
59a3409182
Update README.md
3 years ago
Ross Wightman
ddc29da974
Add ResNet101 and ResNet152 weights from higher aug RSB recipes. 81.93 and 82.82 top-1 at 224x224.
3 years ago
Ross Wightman
b328e56f49
Update eca_halonext26ts weights to a better set
3 years ago
Ross Wightman
2ddef942b9
Better fix for #954 that doesn't break torchscript, pull torch._assert into timm namespace when it exists
3 years ago
Ross Wightman
4f0f9cb348
Fix #954 by bringing traceable _assert into timm to allow compat w/ PyTorch < 1.8
3 years ago
Ross Wightman
a41de1f666
Add interpolation mode handling to transforms. Removes InterpolationMode warning. Works for torchvision versions w/ and w/o InterpolationMode enum. Fix #738 .
3 years ago
Ross Wightman
ed41d32637
Add repr to auto_augment and random_erasing impl
3 years ago
Ross Wightman
135a48d024
Fix sam result again for imagenetv2
3 years ago
Ross Wightman
aaff2d82d0
Add new 50ts attn models to benchmark/meta csv files
3 years ago
Ross Wightman
1e17863b7b
Fixed botne*t26 model results, add some 50ts self-attn variants
3 years ago
Ross Wightman
ae72d009fa
Add weights for lambda_resnet50ts, halo2botnet50ts, lamhalobotnet50ts, updated halonet50ts
3 years ago
Ross Wightman
a45186a6e8
Merge remote-tracking branch 'origin/master' into bits_and_tpu
3 years ago
Ross Wightman
13178ba73a
Add benchmark and metadata csv files
3 years ago
Ross Wightman
b745d30a3e
Fix formatting of last commit
3 years ago
Ross Wightman
3478f1d7f1
Traceability fix for vit models for some experiments
3 years ago
Ross Wightman
f658a72e72
Cleanup re-use of Dropout modules in Mlp modules after some twitter feedback :p
3 years ago
Ross Wightman
71f00bfe9e
Don't run profile if model is torchscripted
3 years ago
Ross Wightman
7da1b0b61c
Merge pull request #933 from t-vi/unbind
...
use .unbind instead of explicitly listing the indices
3 years ago
Ross Wightman
5882e62ada
Add activation count to fvcore based profiling in benchmark.py
3 years ago
Ross Wightman
51f488b7f5
Update results.csv files with latest weights
3 years ago
Thomas Viehmann
f805ba86d9
use .unbind instead of explicitly listing the indices
3 years ago
Ross Wightman
57992509f9
Fix some formatting in utils/model.py
3 years ago
Ross Wightman
0fe4fd3f1f
add d8 and e8 regnetz models with group size 8
3 years ago
Ross Wightman
690f31d02d
Post merge cleanup, restore previous unwrap fn
3 years ago
Ross Wightman
3b6ba76126
Merge remote-tracking branch 'origin/master' into bits_and_tpu
3 years ago
Ross Wightman
25e7c8c5e5
Update broken resnetv2_50 weight url, add resnetv1_101 a1h recipe weights for 224x224 train
3 years ago
Ross Wightman
f7325c7b71
Support either deepspeed or fvcore for flop profiling
3 years ago
Ross Wightman
66253790d4
Add `--bench profile` mode for benchmark.py to just run deepspeed detailed profile on model
3 years ago
Ross Wightman
13a8bf7972
Add train size override and deepspeed GMACs counter (if deepspeed installed) to benchmark.py
3 years ago
Ross Wightman
0ba73e6bcb
Update README.md
3 years ago
Ross Wightman
b6caa356d2
Fixed eca_botnext26ts_256 weights added, 79.27
3 years ago
Ross Wightman
c02334d9fa
Add weights for regnetz_d and haloregnetz_c, update regnetz_c weights. Add commented PyTorch XLA code for halo attention
3 years ago
Ross Wightman
02daf2ab94
Add option to include relative pos embedding in the attention scaling as per references. See discussion #912
3 years ago
Ross Wightman
2c33ca6d8c
Merge pull request #913 from ground0state/master
...
Fix bugs that Mixup does not work when device is cpu
3 years ago
masafumi
047a5ec05f
Fix bugs that Mixup does not work device=cpu
3 years ago
Ross Wightman
cd34913278
Remove some outdated comments, botnet networks working great now.
3 years ago
Ross Wightman
6ed4cdccca
Update lambda_resnet26t weights with better set
3 years ago
Ross Wightman
288ece0e9f
Merge pull request #910 from tmp-iclr/master
...
Add ConvMixer
3 years ago