You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Ross Wightman
c53cf76fa3
Torchscript fixes/hacks for rms_norm, refactor ParallelScalingBlock with manual combination of input projections, closer paper match
|
2 years ago |
.. |
data
|
Improve support for custom dataset label name/description through HF hub export, via pretrained_cfg
|
2 years ago |
layers
|
Torchscript fixes/hacks for rms_norm, refactor ParallelScalingBlock with manual combination of input projections, closer paper match
|
2 years ago |
loss
|
Remove inplace operators when calculating the loss
|
3 years ago |
models
|
Torchscript fixes/hacks for rms_norm, refactor ParallelScalingBlock with manual combination of input projections, closer paper match
|
2 years ago |
optim
|
Add Lion optimizer
|
2 years ago |
scheduler
|
Scheduler update, add v2 factory method, support scheduling on updates instead of just epochs. Add LR to summary csv. Add lr_base scaling calculations to train script. Fix #1168
|
3 years ago |
utils
|
Davit update formatting and fix grad checkpointing (#7)
|
3 years ago |
__init__.py
|
Major module / path restructure, timm.models.layers -> timm.layers, add _ prefix to all non model modules in timm.models
|
3 years ago |
version.py
|
Version 0.8.11dev0
|
2 years ago |