Ross Wightman
|
4ca52d73d8
|
Add separate set and update method to ModelEmaV2
|
4 years ago |
Ross Wightman
|
2ed8f24715
|
A few more changes for 0.3.2 maint release. Linear layer change for mobilenetv3 and inception_v3, support no bias for linear wrapper.
|
4 years ago |
Ross Wightman
|
6504a42832
|
Version 0.3.2
|
4 years ago |
Ross Wightman
|
460eba7f24
|
Work around casting issue with combination of native torch AMP and torchscript for Linear layers
|
4 years ago |
Ross Wightman
|
5f4b6076d8
|
Fix inplace arg compat for GELU and PreLU via activation factory
|
4 years ago |
Ross Wightman
|
fd962c4b4a
|
Native SiLU (Swish) op doesn't export to ONNX
|
4 years ago |
Ross Wightman
|
27bbc70d71
|
Add back old ModelEma and rename new one to ModelEmaV2 to avoid compat breaks in dependant code. Shuffle train script, add a few comments, remove DataParallel support, support experimental torchscript training.
|
4 years ago |
Ross Wightman
|
9214ca0716
|
Simplifying EMA...
|
4 years ago |
Ross Wightman
|
53aeed3499
|
ver 0.3.1
|
4 years ago |
Ross Wightman
|
30ab4a1494
|
Fix issue in optim factory with sgd / eps flag. Bump version to 0.3.1
|
4 years ago |
Ross Wightman
|
741572dc9d
|
Bump version to 0.3.0 for pending PyPi push
|
4 years ago |
Ross Wightman
|
b401952caf
|
Add newly added vision transformer large/base 224x224 weights ported from JAX official repo
|
4 years ago |
Ross Wightman
|
61200db0ab
|
in_chans=1 working w/ pretrained weights for vision_transformer
|
4 years ago |
Ross Wightman
|
e90edce438
|
Support native silu activation (aka swish). An optimized ver is available in PyTorch 1.7.
|
4 years ago |
Ross Wightman
|
da6cd2cc1f
|
Fix regression for pretrained classifier loading when using entrypt functions directly
|
4 years ago |
Ross Wightman
|
f591e90b0d
|
Make sure num_features attr is present in vit models as with others
|
4 years ago |
Ross Wightman
|
4a3df7842a
|
Fix topn metric view regression on PyTorch 1.7
|
4 years ago |
Ross Wightman
|
f944242cb0
|
Fix #262, num_classes arg mixup. Make vision_transformers a bit closer to other models wrt get/reset classfier/forward_features. Fix torchscript for ViT.
|
4 years ago |
Ross Wightman
|
736f209e7d
|
Update vision transformers to be compatible with official code. Port official ViT weights from jax impl.
|
4 years ago |
Ross Wightman
|
477a78ed81
|
Fix optimizer factory regressin for optimizers like sgd/momentum that don't have an eps arg
|
4 years ago |
Ross Wightman
|
27a93e9de7
|
Improve test crop for ViT models. Small now 77.85, added base weights at 79.35 top-1.
|
4 years ago |
Ross Wightman
|
d4db9e7977
|
Add small vision transformer weights. 77.42 top-1.
|
4 years ago |
talrid
|
27fadaa922
|
asymmetric_loss
|
4 years ago |
Ross Wightman
|
f31933cb37
|
Initial Vision Transformer impl w/ patch and hybrid variants. Refactor tuple helpers.
|
4 years ago |
Ross Wightman
|
a4d8fea61e
|
Add model based wd skip support. Improve cross version compat of optimizer factory. Fix #247
|
4 years ago |
Ross Wightman
|
80078c47bb
|
Add Adafactor and Adahessian optimizers, cleanup optimizer arg passing, add gradient clipping support.
|
4 years ago |
Ross Wightman
|
fcb6258877
|
Add missing leaky_relu layer factory defn, update Apex/Native loss scaler interfaces to support unscaled grad clipping. Bump ver to 0.2.2 for pending release.
|
4 years ago |
Ross Wightman
|
e8e2d9cabf
|
Add DropPath (stochastic depth) to ReXNet and VoVNet. RegNet DropPath impl tweak and dedupe se args.
|
4 years ago |
Ross Wightman
|
e8ca45854c
|
More models in sotabench, more control over sotabench run, dataset filename extraction consistency
|
4 years ago |
Ross Wightman
|
9c406532bd
|
Add EfficientNet-EdgeTPU-M (efficientnet_em) model trained natively in PyTorch. More sotabench fiddling.
|
4 years ago |
Ross Wightman
|
c40384f5bd
|
Add ResNet weights. 80.5 (top-1) ResNet-50-D, 77.1 ResNet-34-D, 72.7 ResNet-18-D.
|
4 years ago |
Ross Wightman
|
47a7b3b5b1
|
More flexible mixup mode, add 'half' mode.
|
4 years ago |
Ross Wightman
|
532e3b417d
|
Reorg of utils into separate modules
|
4 years ago |
Ross Wightman
|
33f8a1bf36
|
Updated README, add wide_resnet50_2 and seresnext50_32x4d weights
|
4 years ago |
Ross Wightman
|
751b0bba98
|
Add global_pool (--gp) arg changes to allow passing 'fast' easily for train/validate to avoid channels_last issue with AdaptiveAvgPool
|
4 years ago |
Ross Wightman
|
9c297ec67d
|
Cleanup Apex vs native AMP scaler state save/load. Cleanup CheckpointSaver a bit.
|
4 years ago |
Ross Wightman
|
80c9d9cc72
|
Add 'fast' global pool option, remove redundant SEModule from tresnet, normal one is now 'fast'
|
4 years ago |
Ross Wightman
|
90a01f47d1
|
hrnet features_only pretrained weight loading issue. Fix #232.
|
4 years ago |
Ross Wightman
|
110a7c4982
|
AdaptiveAvgPool2d -> mean((2,3)) for all SE/attn layers to avoid NaN with AMP + channels_last layout. See https://github.com/pytorch/pytorch/issues/43992
|
4 years ago |
Ross Wightman
|
c2cd1a332e
|
Improve torch amp support and add channels_last support for train/validate scripts
|
4 years ago |
Ross Wightman
|
470220b1f4
|
Fix MobileNetV3 crash with global_pool='', output consistent with other models but not equivalent due to efficient head.
|
4 years ago |
Ross Wightman
|
fc8b8afb6f
|
Fix a silly bug in Sample version of EvoNorm missing x* part of swish, update EvoNormBatch to accumulated unbiased variance.
|
4 years ago |
Ross Wightman
|
f614df7921
|
Bump version to 0.2.1 and update README
|
4 years ago |
Ross Wightman
|
b423bc8362
|
Merge pull request #218 from rwightman/cutmix
CutMix + MixUp overhaul
|
4 years ago |
Ross Wightman
|
8c9814e3f5
|
Final cleanup of mixup/cutmix. Element/batch modes working with both collate (prefetcher active) and without prefetcher.
|
4 years ago |
Ross Wightman
|
0f5d9d8166
|
Add CSPResNet50 weights, 79.6 top-1 at 256x256
|
4 years ago |
Ross Wightman
|
b1b6e7c361
|
Fix a few more issues related to #216 w/ TResNet (space2depth) and FP16 weights in wide resnets. Also don't completely dump pretrained weights in in_chans != 1 or 3 cases.
|
4 years ago |
Ross Wightman
|
512b2dd645
|
Add new EfficientNet-B3 and RegNetY-3.2GF weights, both just over 82 top-1
|
4 years ago |
Ross Wightman
|
6890300877
|
Add DropPath (stochastic depth) to RegNet
|
4 years ago |
Ross Wightman
|
cd23f55397
|
Fix mixed prec issues with new mixup code
|
4 years ago |