Ross Wightman
51f488b7f5
Update results.csv files with latest weights
3 years ago
Thomas Viehmann
f805ba86d9
use .unbind instead of explicitly listing the indices
3 years ago
Ross Wightman
57992509f9
Fix some formatting in utils/model.py
3 years ago
Ross Wightman
0fe4fd3f1f
add d8 and e8 regnetz models with group size 8
3 years ago
Ross Wightman
25e7c8c5e5
Update broken resnetv2_50 weight url, add resnetv1_101 a1h recipe weights for 224x224 train
3 years ago
Ross Wightman
f7325c7b71
Support either deepspeed or fvcore for flop profiling
3 years ago
Ross Wightman
66253790d4
Add `--bench profile` mode for benchmark.py to just run deepspeed detailed profile on model
3 years ago
Ross Wightman
13a8bf7972
Add train size override and deepspeed GMACs counter (if deepspeed installed) to benchmark.py
3 years ago
Ross Wightman
0ba73e6bcb
Update README.md
3 years ago
Ross Wightman
b6caa356d2
Fixed eca_botnext26ts_256 weights added, 79.27
3 years ago
Ross Wightman
c02334d9fa
Add weights for regnetz_d and haloregnetz_c, update regnetz_c weights. Add commented PyTorch XLA code for halo attention
3 years ago
Ross Wightman
02daf2ab94
Add option to include relative pos embedding in the attention scaling as per references. See discussion #912
3 years ago
Ross Wightman
2c33ca6d8c
Merge pull request #913 from ground0state/master
...
Fix bugs that Mixup does not work when device is cpu
3 years ago
masafumi
047a5ec05f
Fix bugs that Mixup does not work device=cpu
3 years ago
Ross Wightman
cd34913278
Remove some outdated comments, botnet networks working great now.
3 years ago
Ross Wightman
6ed4cdccca
Update lambda_resnet26t weights with better set
3 years ago
Ross Wightman
288ece0e9f
Merge pull request #910 from tmp-iclr/master
...
Add ConvMixer
3 years ago
ICLR Author
44d6d51668
Add ConvMixer
3 years ago
Ross Wightman
a85df34993
Update lambda_resnet26rpt weights to 78.9, add better halonet26t weights at 79.1 with tweak to attention dim
3 years ago
Ross Wightman
38804c721b
Checkpoint clean fn useable stand alone
3 years ago
Ross Wightman
b544ad4d3f
regnetz model default cfg tweaks
3 years ago
Ross Wightman
d80653cb99
Merge branch 'alexander-soare-freeze-functionality'
3 years ago
Ross Wightman
e5da481073
Small post-merge tweak for freeze/unfreeze, add to __init__ for utils
3 years ago
Ross Wightman
5ca72dcc75
Merge branch 'freeze-functionality' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-freeze-functionality
3 years ago
Ross Wightman
e2b8d44ff0
Halo, bottleneck attn, lambda layer additions and cleanup along w/ experimental model defs
...
* align interfaces of halo, bottleneck attn and lambda layer
* add qk_ratio to all of above, control q/k dim relative to output dim
* add experimental haloregnetz, and trionet (lambda + halo + bottle) models
3 years ago
Ross Wightman
e0b3a3fab3
Make test-pooling flag for validate.py opt in
3 years ago
Alexander Soare
431e60c83f
Add acknowledgements for freeze_batch_norm inspiration
3 years ago
Ross Wightman
fbf59c04ee
Change crop ratio on correct resnet50 variant.
3 years ago
Ross Wightman
ae1ff5792f
Clean a1/a2/3 rsb _0 checkpoints properly, fix v2 loading.
3 years ago
Ross Wightman
d123042605
Update README.md
3 years ago
Ross Wightman
cd638d50a5
Merge pull request #880 from rwightman/fixes_bce_regnet
...
A collection of fixes, model experiments, etc
3 years ago
Ross Wightman
93901e992f
Version bump to 0.5.0 for pending release post RSB and ATTN updates
3 years ago
Ross Wightman
da0d39bedd
Update default crop_pct for byoanet
3 years ago
Ross Wightman
cc9bedf373
Add initial ResNet Strikes Back weights for ResNet50 and ResNetV2-50 models
3 years ago
Ross Wightman
64495505b7
Add updated lambda resnet26 and botnet26 checkpoints with fixes applied
3 years ago
Ross Wightman
b2094f4ee8
support bits checkpoints in avg/load
3 years ago
Ross Wightman
007bc39323
Some halo and bottleneck attn code cleanup, add halonet50ts weights, use optimal crop ratios
3 years ago
Alexander Soare
6d2acec1bb
Fix ordering of tests
3 years ago
Alexander Soare
65c3d78b96
Freeze unfreeze functionality finalized. Tests added
3 years ago
Alexander Soare
0cb8ea432c
wip
3 years ago
Ross Wightman
d9abfa48df
Make broadcast_buffers disable its own flag for now (needs more testing on interaction with dist_bn)
3 years ago
Ross Wightman
b1c2e3eb92
Match rel_pos_indices attr rename in conv branch
3 years ago
Ross Wightman
b49630a138
Add relative pos embed option to LambdaLayer, fix last transpose/reshape.
3 years ago
Ross Wightman
d657e2cc0b
Remove dead code line from efficientnet
3 years ago
Ross Wightman
0ca687f224
Make 'regnetz' model experiments closer to actual RegNetZ, bottleneck expansion, expand from in_chs, no shortcut on stride 2, tweak model sizes
3 years ago
Ross Wightman
b5bf4dce98
Merge pull request #898 from leondgarse/master
...
Remove a duplicate layer creation in byobnet.py
3 years ago
leondgarse
51eaf9360d
Remove a duplicate layer creation in byobnet.py
...
`self.conv2_kxk` is repeated in `byobnet.py`. Remove the duplicate code.
3 years ago
Ross Wightman
b81e79aae9
Fix bottleneck attn transpose typo, hopefully these train better now..
3 years ago
Ross Wightman
80075b0b8a
Add worker_seeding arg to allow selecting old vs updated data loader worker seed for (old) experiment repeatability
3 years ago
Ross Wightman
6478bcd02c
Fix regnetz_d conv layer name, use inception mean/std
3 years ago