ba2ca4b464One codepath for stdconv, switch layernorm to batchnorm so gain included. Tweak epsilon values for nfnet, resnetv2, vit hybrid.
Ross Wightman
2021-06-12 12:27:43 -0700
07fb05cc3dUpdate results csv files
Ross Wightman
2021-06-09 22:33:05 -0700
Global instead of local rank.
#693
SamuelGabriel
2021-06-09 19:11:58 +0200
b7a568f065Fix torchscript issue in bat
Ross Wightman
2021-06-08 23:19:51 -0700
d17b374f0fMinimum input_size needed to be higher
Ross Wightman
2021-06-08 21:31:39 -0700
b3b90d944dAdd min_input_size to bat_resnext to prevent test breakage.
Ross Wightman
2021-06-08 17:32:08 -0700
758c4438a7Update README.md
Ross Wightman
2021-06-08 15:19:11 -0700
d413eef1bfAdd ResMLP-24 model weights that I trained in PyTorch XLA on TPU-VM. 79.2 top-1.
Ross Wightman
2021-06-08 14:22:05 -0700
10d8fa4620Add gc and bat attention resnext26ts variants to byob for test.
Ross Wightman
2021-06-08 14:21:07 -0700
2f5ed2dec1Update `init_values` const for 24 and 36 layer ResMLP models
Ross Wightman
2021-06-07 17:15:04 -0700
8e4ac3549fAll ScaledStdConv and StdConv uses default to using F.layernorm so that they work with PyTorch XLA. eps value tweaking is a WIP.
Ross Wightman
2021-06-07 17:14:19 -0700
2a63d0246bPost merge cleanup
Ross Wightman
2021-06-07 14:38:30 -0700
Update README.md
Ross Wightman
2021-06-04 14:23:34 -0700
6b2d9c2660Another bits/README.md update
Ross Wightman
2021-06-04 13:08:29 -0700
c3db5f5801Worker hack for TFDS eval, add TPU env var setting.
Ross Wightman
2021-06-04 13:00:52 -0700
f411724de4Fix checkpoint delete issue. Add README about bits and initial Pytorch XLA usage on TPU-VM. Add some FIXMEs and fold train_cfg into train_state by default.
Ross Wightman
2021-06-04 12:49:53 -0700
b57a03bd0dMerge branch 'master' into bits_and_tpu
Ross Wightman
2021-06-03 17:51:20 -0700
91ab0b6ce5Add proper TrainState checkpoint save/load. Some reorg/refactoring and other cleanup. More to go...
Ross Wightman
2021-06-03 17:49:40 -0700
bda8ab015aRemove min channels for SelectiveKernel, divisor should cover cases well enough.
Ross Wightman
2021-05-31 15:38:56 -0700
a27f4aec4aMissed args for skresnext w/ refactoring.
Ross Wightman
2021-05-31 14:06:34 -0700
307a935b79Add non-local and BAT attention. Merge attn and self-attn factories into one. Add attention references to README. Add mlp 'mode' to ECA.
Ross Wightman
2021-05-31 13:18:11 -0700
34522097b1See if we can use tcmalloc in test runner
Ross Wightman
2021-05-30 21:12:10 -0700
8bf63b6c6cAble to use other attn layer in EfficientNet now. Create test ECA + GC B0 configs. Make ECA more configurable.
Ross Wightman
2021-05-30 12:47:02 -0700
8d9ebe3788fix some bugs
lixinjie
2021-05-31 00:36:13 +0800
bd996241f6fix some bugs
lixinjie
2021-05-30 21:25:28 +0800
bcec14d3b5Bring EfficientNet SE layer in line with others, pull se_ratio outside of blocks. Allows swapping w/ other attn layers.
Ross Wightman
2021-05-29 23:41:38 -0700
041a85fe7bfix some bugs to support segmentation task
lixinjie
2021-05-30 12:55:21 +0800
Merge branch 'master' into master
abcdvzz
2021-05-29 21:26:53 +0800
9611458e19Throw in some FBNetV3 code I had lying around, some refactoring of SE reduction channel calcs for all EffNet archs.
Ross Wightman
2021-05-28 20:47:24 -0700
01b9108619Merge branch 'master' into more_attn
Ross Wightman
2021-05-28 11:09:37 -0700
d7bab8a6c5Fix strict flag change for checkpoint load.
Ross Wightman
2021-05-28 09:54:50 -0700
02f9d4bc34Add weights for resnet51q model, add 61q def.
Ross Wightman
2021-05-28 09:53:16 -0700
c0c03c4e97fix args name bug
#669
John Krafcik
2021-05-28 11:39:44 +0800
f615474be3Fix broken test, repvgg block doesn't have attn_last attr.
Ross Wightman
2021-05-27 18:12:22 -0700
742c2d5247Add Gather-Excite and Global Context attn modules. Refactor existing SE-like attn for consistency and refactor byob/byoanet for less redundancy.
Ross Wightman
2021-05-27 18:03:29 -0700
9c78de8c02Fix#661, move hardswish out of default args for LeViT. Enable native torch support for hardswish, hardsigmoid, mish if present.
Ross Wightman
2021-05-26 15:28:42 -0700
51c432150aREADME: fix simple typos
#660
Peter Vandenabeele
2021-05-25 22:42:44 +0200
fd92ba0de8Filter large vit models from torchscript tests
Ross Wightman
2021-05-25 12:52:07 -0700
318360c3f9Update README.md before merge. Bump version to 0.4.10
Ross Wightman
2021-05-25 12:25:53 -0700
99d97e0d67Hopefully the last test update for this PR...
Ross Wightman
2021-05-25 11:10:17 -0700
11ae795e99Redo LeViT attention bias caching in a way that works with both torchscript and DataParallel
Ross Wightman
2021-05-25 10:15:32 -0700
d400f1dbddFilter test models before creation for backward/torchscript tests
Ross Wightman
2021-05-25 10:14:45 -0700
43180b1341update twins.py to support segmentation task
李鑫杰
2021-05-25 18:07:50 +0800
c4572cc5aaAdd Visformer-small weighs, tweak torchscript jit test img size.
Ross Wightman
2021-05-24 22:50:12 -0700
83487e2a0dLower max backward size for tests.
Ross Wightman
2021-05-24 21:36:56 -0700
bfc72f75d3Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.
Ross Wightman
2021-05-24 21:13:26 -0700
18bf520ad1Add eca_nfnet_l2/l3 defs for future training
Ross Wightman
2021-05-22 21:55:37 -0700
f45de37690Merge branch 'master' into levit_visformer_rednet
Ross Wightman
2021-05-22 16:34:31 -0700
23c18a33e4Add efficientnetv2_rw_m weights trained in PyTorch. 84.8 top-1 @ 416 test. 53M params.
Ross Wightman
2021-05-21 21:16:25 -0700
5b9c69e80aAdd basic training resume based on legacy code
Ross Wightman
2021-05-21 18:08:06 -0700
c2ba229d99Prep for effcientnetv2_rw_m model weights that started training before official release..
Ross Wightman
2021-05-21 17:47:49 -0700
22f7c6760fMerge branch 'alexander-soare-master'
Ross Wightman
2021-05-21 17:21:40 -0700
30b9880d06Minor adjustment, mutable default arg, extra check of valid len...
Ross Wightman
2021-05-21 17:20:33 -0700