Ross Wightman
|
515121cca1
|
Use reshape instead of view in std_conv, causing issues in recent PyTorch in channels_last
|
3 years ago |
Ross Wightman
|
8319e0c373
|
Add file docstring to std_conv.py
|
4 years ago |
Ross Wightman
|
ba2ca4b464
|
One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
|
4 years ago |
Ross Wightman
|
8e4ac3549f
|
All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
|
4 years ago |
Ross Wightman
|
740f32c96a
|
Add ECA-NFNet-L0 weights and update model name. Update README and bump version to 0.4.6
|
4 years ago |
Ross Wightman
|
8563609b28
|
Update notes in ScaledStdConv impl
|
4 years ago |
Ross Wightman
|
678ba4e0a2
|
Add NFNet-F model weights ported from DeepMind Haiku impl and new set of models w/ compatible config.
|
4 years ago |
Ross Wightman
|
90980de4a9
|
Fix up a few details in NFResNet models, managed stable training. Add support for gamma gain to be applied in activation or ScaleStdConv. Some tweaks to ScaledStdConv.
|
4 years ago |
Ross Wightman
|
5a8e1e643e
|
Initial Normalizer-Free Reg/ResNet impl. A bit of related layer refactoring.
|
4 years ago |