Commit Graph

843 Commits (ce5578bc3a3e2def84df79bc08e10be5f5fc7a14)

Author SHA1 Message Date
Ross Wightman 20a1fa63f8 Make dev version 0.6.2.dev0 for pypi pre
2 years ago
Ross Wightman 347308faad Update README.md, version to 0.6.2
2 years ago
Ross Wightman 4b30bae67b Add updated vit_relpos weights, and impl w/ support for official swin-v2 differences for relpos. Add bias control support for MLP layers
2 years ago
Ross Wightman d4c0588012 Remove persistent buffers from Swin-V2. Change SwinV2Cr cos attn + tau/logit_scale to match official, add ckpt convert, init_value zeros resid LN weight by default
2 years ago
Ross Wightman 27c42f0830 Fix torchscript use for offician Swin-V2, add support for non-square window/shift to WindowAttn/Block
2 years ago
Ross Wightman 2f2b22d8c7 Disable nvfuser fma / opt level overrides per #1244
2 years ago
Ross Wightman c0211b0bf7 Swin-V2 test fixes, typo
2 years ago
Ross Wightman 9a86b900fa Official SwinV2 models
2 years ago
Ross Wightman d07d015173
Merge pull request #1249 from okojoalg/sequencer
2 years ago
Ross Wightman d30685c283
Merge pull request #1251 from hankyul2/fix-multistep-scheduler
2 years ago
han a16171335b fix: change milestones to decay-milestones
2 years ago
Ross Wightman 39b725e1c9 Fix tests for rank-4 output where feature channels dim is -1 (3) and not 1
2 years ago
Ross Wightman 78a32655fa Fix poolformer group_matcher to merge proj downsample with previous block, support coarse
2 years ago
Ross Wightman d79f3d9d1e Fix torchscript use for sequencer, add group_matcher, forward_head support, minor formatting
2 years ago
Ross Wightman 37b6920df3 Fix group_matcher regex for regnet.py
2 years ago
okojoalg 93a79a3dd9 Fix num_features in Sequencer
2 years ago
han 57a988df30 fix: multistep lr decay epoch bugs
2 years ago
okojoalg 578d52e752 Add Sequencer
2 years ago
Ross Wightman f5ca4141f7 Adjust arg order for recent vit model args, add a few comments
2 years ago
Ross Wightman 41dc49a337 Vision Transformer refactoring and Rel Pos impl
2 years ago
Ross Wightman b7cb8d0337 Add Swin-V2 Small-NS weights (83.5 @ 224). Add layer scale like 'init_values' via post-norm LN weight scaling
2 years ago
jjsjann123 f88c606fcf fixing channels_last on cond_conv2d; update nvfuser debug env variable
2 years ago
Li Dong 09e9f3defb
migrate azure blob for beit checkpoints
2 years ago
Ross Wightman 52ac881402 Missed first_conv in latest seresnext 'D' default_cfgs
2 years ago
Ross Wightman 7629d8264d Add two new SE-ResNeXt101-D 32x8d weights, one anti-aliased and one not. Reshuffle default_cfgs vs model entrypoints for resnet.py so they are better aligned.
2 years ago
SeeFun 8f0bc0591e fix convnext args
2 years ago
Ross Wightman c5a8e929fb Add initial swinv2 tiny / small weights
2 years ago
Ross Wightman f670d98cb8 Make a few more layers symbolically traceable (remove from FX leaf modules)
2 years ago
SeeFun ec4e9aa5a0
Add ConvNeXt tiny and small pretrain in22k
2 years ago
Ross Wightman 575924ed60 Update test crop for new RegNet-V weights to match Y
2 years ago
Ross Wightman 1618527098 Add layer scale and parallel blocks to vision_transformer
2 years ago
Ross Wightman c42be74621 Add attrib / comments about Swin-S3 (AutoFormerV2) weights
2 years ago
Ross Wightman 474ac906a2 Add 'head norm first' convnext_tiny_hnf weights
2 years ago
Ross Wightman dc51334cdc Fix pruned adapt for EfficientNet models that are now using BatchNormAct layers
2 years ago
Ross Wightman 024fc4d9ab version 0.6.1 for master
2 years ago
Ross Wightman e1e037ba52 Fix bad tuple typing fix that was on XLA branch bust missed on master merge
2 years ago
Ross Wightman 341b464a5a Remove redundant noise attr from Plateau scheduler (use parent)
2 years ago
Ross Wightman fe457c1996 Update SwinTransformerV2Cr post-merge, update with grad checkpointing / grad matcher
2 years ago
Ross Wightman b049a5c5c6 Merge remote-tracking branch 'origin/master' into norm_norm_norm
2 years ago
Ross Wightman 7cdd164d77 Fix #1184, scheduler noise bug during merge madness
2 years ago
Ross Wightman 9440a50c95 Merge branch 'mrT23-master'
2 years ago
Ross Wightman d98aa47d12 Revert ml-decoder changes to model factory and train script
2 years ago
Ross Wightman b20665d379
Merge pull request #1007 from qwertyforce/patch-1
2 years ago
Ross Wightman 7a0994f581
Merge pull request #1150 from ChristophReich1996/master
2 years ago
Ross Wightman 61d3493f87 Fix hf-hub handling when hf-hub is config source
2 years ago
Ross Wightman 5f47518f27 Fix pit implementation to be clsoer to deit/levit re distillation head handling
2 years ago
Ross Wightman 0862e6ebae Fix correctness of some group matching regex (no impact on result), some formatting, missed forward_head for resnet
2 years ago
Ross Wightman 94bcdebd73 Add latest weights trained on TPU-v3 VM instances
2 years ago
Ross Wightman 0557c8257d Fix bug introduced in non layer_decay weight_decay application. Remove debug print, fix arg desc.
2 years ago
Ross Wightman 372ad5fa0d Significant model refactor and additions:
2 years ago