Commit Graph

460 Commits (81cd6863c8c9515de8884e8a8ea0445ec08b4486)

Author SHA1 Message Date
Ross Wightman 81cd6863c8 Move aggregation (convpool) for nest into NestLevel, cleanup and enable features_only use. Finalize weight url.
4 years ago
Ross Wightman 6ae0ac6420 Merge branch 'nested_transformer' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-nested_transformer
4 years ago
Alexander Soare 7b8a0017f1 wip to review
4 years ago
Alexander Soare b11d949a06 wip checkpoint with some feature extraction work
4 years ago
Alexander Soare 23bb72ce5e nested_transformer wip
4 years ago
Ross Wightman 766b4d3262 Fix features for resnetv2_50t
4 years ago
Ross Wightman e8045e712f Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
4 years ago
Ross Wightman 20a2be14c3 Add gMLP-S weights, 79.6 top-1
4 years ago
Ross Wightman 85f894e03d Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
4 years ago
Ross Wightman b41cffaa93 Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
4 years ago
Ross Wightman 9c9755a808 AugReg release
4 years ago
Ross Wightman 381b279785 Add hybrid model fwds back
4 years ago
Ross Wightman 26f04a8e3e Fix a weight link
4 years ago
Ross Wightman 8f4a0222ed Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
4 years ago
Ross Wightman b319eb5b5d Update ViT weights, more details to be added before merge.
4 years ago
Ross Wightman 8257b86550 Fix up resnetv2 bit/bitm model default res
4 years ago
Ross Wightman 1228f5a3d8 Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
4 years ago
Ross Wightman 511a8e8c96 Add official ResMLP weights.
4 years ago
Ross Wightman b9cfb64412 Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
4 years ago
Ross Wightman 8319e0c373 Add file docstring to std_conv.py
4 years ago
Ross Wightman 4d96165989 Merge branch 'master' into cleanup_xla_model_fixes
4 years ago
Ross Wightman 8880f696b6 Refactoring, cleanup, improved test coverage.
4 years ago
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
4 years ago
Ross Wightman b7a568f065 Fix torchscript issue in bat
4 years ago
Ross Wightman d17b374f0f Minimum input_size needed to be higher
4 years ago
Ross Wightman b3b90d944d Add min_input_size to bat_resnext to prevent test breakage.
4 years ago
Ross Wightman d413eef1bf Add ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
4 years ago
Ross Wightman 10d8fa4620 Add gc and bat attention resnext26ts variants to byob for test.
4 years ago
Ross Wightman 2f5ed2dec1 Update `init_values` const for 24 and 36 layer ResMLP models
4 years ago
Ross Wightman 8e4ac3549f All ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
4 years ago
Ross Wightman 2a63d0246b Post merge cleanup
4 years ago
Ross Wightman 45dec179e5
Merge pull request #681 from lmk123568/master
4 years ago
Dongyoon Han ded1671483 Fix stochastic depth working only with a shortcut
4 years ago
Mike b87d98b238
Update convit.py
4 years ago
Ross Wightman bda8ab015a Remove min channels for SelectiveKernel, divisor should cover cases well enough.
4 years ago
Ross Wightman a27f4aec4a Missed args for skresnext w/ refactoring.
4 years ago
Ross Wightman 307a935b79 Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
4 years ago
Ross Wightman 8bf63b6c6c Able to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
4 years ago
Ross Wightman bcec14d3b5 Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
4 years ago
Ross Wightman 9611458e19 Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
4 years ago
Ross Wightman 01b9108619 Merge branch 'master' into more_attn
4 years ago
Ross Wightman d7bab8a6c5 Fix strict flag change for checkpoint load.
4 years ago
Ross Wightman 02f9d4bc34 Add weights for resnet51q model, add 61q def.
4 years ago
Ross Wightman f615474be3 Fix broken test, repvgg block doesn't have attn_last attr.
4 years ago
Ross Wightman 742c2d5247 Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
4 years ago
Ross Wightman 9c78de8c02 Fix #661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
4 years ago
Ross Wightman 5db7452173 Fix visformer in_chans stem handling
4 years ago
Ross Wightman 11ae795e99 Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
4 years ago
Ross Wightman d400f1dbdd Filter test models before creation for backward/torchscript tests
4 years ago
Ross Wightman c4572cc5aa Add Visformer-small weighs, tweak torchscript jit test img size.
4 years ago