added training script note
#1420
Florian
2022-08-19 14:33:36 +0200
f332fc2db7Fix some test failures, torchscript issues
Ross Wightman
2022-08-18 16:19:46 -0700
6e559e9b5fAdd MViT (Multi-Scale) V2
Ross Wightman
2022-08-17 15:12:31 -0700
43aa84e861Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)
Ross Wightman
2022-08-17 14:32:58 -0700
c486aa71f8Add GCViT
Ross Wightman
2022-08-17 14:29:18 -0700
fba6ecd39bAdd EfficientFormer
Ross Wightman
2022-08-17 14:08:53 -0700
ff4a38e2c3Add PyramidVisionTransformerV2
Ross Wightman
2022-08-17 12:06:05 -0700
5a40c6a3c4Fix issue with torchvision's ImageNet
Edoardo Debenedetti
2022-08-17 15:10:41 +0200
1d8ada359aAdd timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size
v0.1-weights-morevit
Ross Wightman
2022-08-15 17:56:08 -0700
7c4682dc08Update README.md
Ross Wightman
2022-08-05 17:28:56 -0700
2544d3b80fConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs
Ross Wightman
2022-08-05 17:05:50 -0700
13565aad50Add edgenext_base model def & weight link, update to improve ONNX export #1385
Ross Wightman
2022-08-05 16:58:34 -0700
56596e4e84jit trace comparisons snuck into torchscript part of validate.py, fixed
Ross Wightman
2022-07-31 21:13:56 -0700
Update type hint for `register_notrace_module`
#1363
Jasha10
2022-07-22 16:59:55 -0500
d7b55a9429Add gmacs and macts columns to inference benchmark (missed profile in initial run)
Ross Wightman
2022-07-22 12:12:37 -0700
1b278136c3Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355
Ross Wightman
2022-07-21 17:36:15 -0700
909705e7ffRemove some redundant requires_grad=True from nn.Parameter in third party code
Ross Wightman
2022-07-20 12:37:41 -0700
c5e0d1c700Add dilation support to convnext, allows output_stride=8 and 16 use. Fix#1341
Ross Wightman
2022-07-19 17:52:10 -0700
5e7d47ca10Add pytorch 1.12 benchmark csv files w/ 0.6.6 code. Remove pytorch 1.10 results. Deciding whether to update 1.11 results or remove...
Ross Wightman
2022-07-19 15:29:38 -0700
dc376e3676Ensure all model entrypoint fn default to `pretrained=False` (a few didn't)
Ross Wightman
2022-07-19 13:58:41 -0700
23b102064aAdd cs3sedarknet_x weights w/ 82.65 @ 288 top1. Add 2 cs3 edgenet models (w/ 3x3-1x1 block), remove aa from cspnet blocks (not needed)
Ross Wightman
2022-07-19 13:21:20 -0700
0dbd9352ceAdd bulk_runner script and updates to benchmark.py and validate.py for better error handling in bulk runs (used for benchmark and validation result runs). Improved batch size decay stepping on retry...
Ross Wightman
2022-07-18 18:01:39 -0700
Merge pull request #1354 from rwightman/fix_tests
Ross Wightman
2022-07-17 21:37:16 -0700
29afe79c8bAttempt to fix unit tests by removing subset of tests on mac runner
#1354
fix_tests
Ross Wightman
2022-07-17 14:55:47 -0700
326ade2999Add updated validation / test set results, benchmarks still running...
Ross Wightman
2022-07-16 16:15:28 -0700
92b91af3bbversion 0.6.6
Ross Wightman
2022-07-15 17:00:18 -0700
05313940e2Add cs3darknet_x, cs3sedarknet_l, and darknetaa53 weights from TPU sessions. Move SE btwn conv1 & conv2 in DarkBlock. Improve SE/attn handling in Csp/DarkNet. Fix leaky_relu bug on older csp models.
Ross Wightman
2022-07-15 16:53:51 -0700
a1cb25066eAdd edgnext_small_rw weights trained with swin like recipe. Better than original 'small' but not the recent 'USI' distilled weights.
Ross Wightman
2022-07-07 22:02:57 -0700
7c7ecd2492Add --use-train-size flag to force use of train input_size (over test input size) for validation. Default test-time pooling to use train input size (fixes issues).
Ross Wightman
2022-07-07 22:01:24 -0700
ce65a7b29fUpdate vit_relpos w/ some additional weights, some cleanup to match recent vit updates, more MLP log coord experiments.
Ross Wightman
2022-07-07 21:33:25 -0700
58621723bdAdd CrossStage3 DarkNet (cs3) weights
Ross Wightman
2022-07-07 17:43:38 -0700
9be0c84715Change set -> dict w/ None keys for dataset split synonym search, so always consistent if more than 1 exists. Fix#1224
Ross Wightman
2022-07-07 15:33:53 -0700
4670d375c6Reorg benchmark.py import
Ross Wightman
2022-07-07 15:21:29 -0700
Merge pull request #1336 from xwang233/add-local-rank
Ross Wightman
2022-07-07 15:20:30 -0700
500c190860Add --aot-autograd (functorch efficient mem fusion) support to validate.py
Ross Wightman
2022-07-07 15:15:25 -0700
28e0152043Add --no-retry flag to benchmark.py to skip batch_size decay and retry on error. Fix#1226. Update deepspeed profile usage for latest DS releases. Fix # 1333
Ross Wightman
2022-07-07 15:13:06 -0700
11060f84c5make train.py compatible with torchrun
#1336
Xiao Wang
2022-07-07 14:44:55 -0700
db0cee9910Refactor cspnet configuration using dataclasses, update feature extraction for new cs3 variants.
Ross Wightman
2022-07-07 14:43:27 -0700
eca09b8642Add MobileVitV2 support. Fix#1332. Move GroupNorm1 to common layers (used in poolformer + mobilevitv2). Keep ol custom ConvNeXt LayerNorm2d impl as LayerNormExp2d for reference.
Ross Wightman
2022-07-07 14:41:01 -0700
06307b8b41Remove experimental downsample in block support in ConvNeXt. Experiment further before keeping it in.
Ross Wightman
2022-07-07 14:37:58 -0700
bfc0dccb0eImprove image extension handling, add methods to modify / get defaults. Fix#1335fix#1274.
Ross Wightman
2022-07-07 14:23:20 -0700
7d4b3807d5Support DeiT-3 (Revenge of the ViT) checkpoints. Add non-overlapping (w/ class token) pos-embed support to vit.
Ross Wightman
2022-07-04 22:25:22 -0700
export.py added to export timm models to onnx
swenkel
2022-07-03 21:44:47 +0300
d0c5bd5722Rename cs2->cs3 for darknets. Fix features_only for cs3 darknets.
Ross Wightman
2022-07-03 08:32:41 -0700
d765305821Remove first_conv for resnetaa50 def
Ross Wightman
2022-07-02 15:56:17 -0700
dd9b8f57c4Add feature_info to edgenext for features_only support, hopefully fix some fx / test errors
Ross Wightman
2022-07-02 15:20:45 -0700
377e9bfa21Add TPU trained darknet53 weights. Add mising pretrain_cfg for some csp/darknet models.
Ross Wightman
2022-07-02 15:18:52 -0700
c170ba3173Add weights for resnet10t, resnet14t, and resnetaa50 models. Fix#1314
Ross Wightman
2022-07-02 15:18:06 -0700
188c194b0fLeft some experiment stem code in convnext by mistake
Ross Wightman
2022-07-02 15:17:28 -0700
70d6d2c484support test_crop_size in data config resolve
Ross Wightman
2022-07-02 15:17:05 -0700
6064d16a2dAdd initial EdgeNeXt import. Significant cleanup / reorg (like ConvNeXt). Fix#1320 * edgenext refactored for torchscript compat, stage base organization * slight refactor of ConvNeXt to match some EdgeNeXt additions * remove use of funky LayerNorm layer in ConvNeXt and just use nn.LayerNorm and LayerNorm2d (permute)
Ross Wightman
2022-07-01 15:16:41 -0700
7a9c6811c9Add eps arg to LayerNorm2d, add 'tf' (tensorflow) variant of trunc_normal_ that applies scale/shift after sampling (instead of needing to move a/b)
Ross Wightman
2022-07-01 15:15:39 -0700
82c311d082Add more experimental darknet and 'cs2' darknet variants (different cross stage setup, closer to newer YOLO backbones) for train trials.
Ross Wightman
2022-07-01 15:14:01 -0700
a050fde5cdAdd resnet10t (basic block) and resnet14t (bottleneck) with 1,1,1,1 repeats
Ross Wightman
2022-07-01 15:03:28 -0700
34f382f8f6move dataconfig before script, scripting killing metadata now (PyTorch 1.12? just nvfuser?)
Ross Wightman
2022-07-01 14:50:36 -0700