Commit Graph

484 Commits (1c9284c640503ad53b819990a4c56985609704e2)

Author SHA1 Message Date
Ross Wightman 1c9284c640 Add BeiT 'finetuned' 1k weights and pretrained 22k weights, pretraining specific (masked) model excluded for now
3 years ago
Ross Wightman f8a215cfe6 A few more crossvit tweaks, fix training w/ no_weight_decay names, add crop option for scaling, adjust default crop_pct for large img size to 1.0 for better results
3 years ago
Ross Wightman 7ab2491ab7 Better handling of crossvit for tests / forward_features, fix torchscript regression in my changes
3 years ago
Ross Wightman f1808e0970 Post crossvit merge cleanup, change model names to reflect input size, cleanup img size vs scale handling, fix tests
3 years ago
Richard Chen 9fe5798bee fix bug for reset classifier and fix for validating the dimension
3 years ago
Richard Chen 3718c5a5bd fix loading pretrained model
3 years ago
Richard Chen bb50b69a57 fix for torch script
3 years ago
Richard Chen 7ab9d4555c add crossvit
3 years ago
Ross Wightman 78933122c9 Fix silly typo
3 years ago
Ross Wightman 708d87a813 Fix ViT SAM weight compat as weights at URL changed to not use repr layer. Fix #825. Tweak optim test.
3 years ago
Ross Wightman 01cb46a9a5 Add gc_efficientnetv2_rw_t weights (global context instead of SE attn). Add TF XL weights even though the fine-tuned ones don't validate that well. Change default arg for GlobalContext to use scal (mul) mode.
3 years ago
Ross Wightman d3f7440650 Add EfficientNetV2 XL model defs
3 years ago
Ross Wightman 72b227dcf5
Merge pull request #750 from drjinying/master
3 years ago
Ross Wightman 748ab852ca Allow act_layer switch for xcit, fix in_chans for some variants
3 years ago
Ying Jin 20b2d4b69d Use bicubic interpolation in resize_pos_embed()
3 years ago
Ross Wightman d3255adf8e Merge branch 'xcit' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-xcit
3 years ago
Ross Wightman f8039c7492 Fix gc effv2 model cfg name
3 years ago
Alexander Soare 3a55a30ed1 add notes from author
3 years ago
Alexander Soare 899cf84ccc bug fix - missing _dist postfix for many of the 224_dist models
3 years ago
Alexander Soare 623e8b8eb8 wip xcit
3 years ago
Ross Wightman 392368e210 Add efficientnetv2_rw_t defs w/ weights, and gc variant, as well as gcresnet26ts for experiments. Version 0.4.13
3 years ago
Ross Wightman 6d8272e92c Add SAM pretrained model defs/weights for ViT B16 and B32 models.
3 years ago
Ross Wightman ee4d8fc69a Remove unecessary line from nest post refactor
3 years ago
Ross Wightman 8165cacd82 Realized LayerNorm2d won't work in all cases as is, fixed.
3 years ago
Ross Wightman 81cd6863c8 Move aggregation (convpool) for nest into NestLevel, cleanup and enable features_only use. Finalize weight url.
3 years ago
Ross Wightman 6ae0ac6420 Merge branch 'nested_transformer' of https://github.com/alexander-soare/pytorch-image-models into alexander-soare-nested_transformer
3 years ago
Alexander Soare 7b8a0017f1 wip to review
3 years ago
Alexander Soare b11d949a06 wip checkpoint with some feature extraction work
3 years ago
Alexander Soare 23bb72ce5e nested_transformer wip
3 years ago
Ross Wightman 766b4d3262 Fix features for resnetv2_50t
3 years ago
Ross Wightman e8045e712f Fix BatchNorm for ResNetV2 non GN models, add more ResNetV2 model defs for future experimentation, fix zero_init of last residual for pre-act.
3 years ago
Ross Wightman 20a2be14c3 Add gMLP-S weights, 79.6 top-1
3 years ago
Ross Wightman 85f894e03d Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints
3 years ago
Ross Wightman b41cffaa93 Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713
3 years ago
Ross Wightman 9c9755a808 AugReg release
3 years ago
Ross Wightman 381b279785 Add hybrid model fwds back
3 years ago
Ross Wightman 26f04a8e3e Fix a weight link
3 years ago
Ross Wightman 8f4a0222ed Add GMixer-24 MLP model weights, trained w/ TPU + PyTorch XLA
3 years ago
Ross Wightman b319eb5b5d Update ViT weights, more details to be added before merge.
3 years ago
Ross Wightman 8257b86550 Fix up resnetv2 bit/bitm model default res
3 years ago
Ross Wightman 1228f5a3d8 Add BiT distilled 50x1 and teacher 152x2 models from 'A good teacher is patient and consistent' paper.
3 years ago
Ross Wightman 511a8e8c96 Add official ResMLP weights.
3 years ago
Ross Wightman b9cfb64412 Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.
3 years ago
Ross Wightman 8319e0c373 Add file docstring to std_conv.py
3 years ago
Ross Wightman 4d96165989 Merge branch 'master' into cleanup_xla_model_fixes
3 years ago
Ross Wightman 8880f696b6 Refactoring, cleanup, improved test coverage.
3 years ago
Ross Wightman ba2ca4b464 One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
3 years ago
Ross Wightman b7a568f065 Fix torchscript issue in bat
3 years ago
Ross Wightman d17b374f0f Minimum input_size needed to be higher
3 years ago
Ross Wightman b3b90d944d Add min_input_size to bat_resnext to prevent test breakage.
3 years ago