Ross Wightman
|
837c68263b
|
For ConvNeXt, use timm internal LayerNorm for fast_norm in non conv_mlp mode
|
2 years ago |
Ross Wightman
|
cac0a4570a
|
More test fixes, pool size for 256x256 maxvit models
|
2 years ago |
Ross Wightman
|
e939ed19b9
|
Rename internal creation fn for maxvit, has not been just coatnet for a while...
|
2 years ago |
Ross Wightman
|
ffaf97f813
|
MaxxVit! A very configurable MaxVit and CoAtNet impl with lots of goodies..
|
2 years ago |
Ross Wightman
|
8c9696c9df
|
More model and test fixes
|
2 years ago |
Ross Wightman
|
ca52108c2b
|
Fix some model support functions
|
2 years ago |
Ross Wightman
|
f332fc2db7
|
Fix some test failures, torchscript issues
|
2 years ago |
Ross Wightman
|
6e559e9b5f
|
Add MViT (Multi-Scale) V2
|
2 years ago |
Ross Wightman
|
43aa84e861
|
Add 'fast' layer norm that doesn't cast to float32, support APEX LN impl for slight speed gain, update norm and act factories, tweak SE for ability to disable bias (needed by GCVit)
|
2 years ago |
Ross Wightman
|
c486aa71f8
|
Add GCViT
|
2 years ago |
Ross Wightman
|
fba6ecd39b
|
Add EfficientFormer
|
2 years ago |
Ross Wightman
|
ff4a38e2c3
|
Add PyramidVisionTransformerV2
|
2 years ago |
Ross Wightman
|
1d8ada359a
|
Add timm ConvNeXt 'atto' weights, change test resolution for FB ConvNeXt 224x224 weights, add support for different dw kernel_size
|
2 years ago |
Ross Wightman
|
7c4682dc08
|
Update README.md
|
2 years ago |
Ross Wightman
|
2544d3b80f
|
ConvNeXt pico, femto, and nano, pico, femto ols (overlapping stem) weights and model defs
|
2 years ago |
Ross Wightman
|
13565aad50
|
Add edgenext_base model def & weight link, update to improve ONNX export #1385
|
2 years ago |
Ross Wightman
|
56596e4e84
|
jit trace comparisons snuck into torchscript part of validate.py, fixed
|
2 years ago |
Ross Wightman
|
8ad4bdfa06
|
Allow ntuple to be used with string values
|
2 years ago |
Christoph Reich
|
faae93e62d
|
Fix typo in PositionalEncodingFourier
|
2 years ago |
Ross Wightman
|
7430a85d07
|
Update README, bump version to 0.6.8
|
2 years ago |
Ross Wightman
|
ec6a28830f
|
Add DeiT-III 'medium' model defs and weights
|
2 years ago |
Ross Wightman
|
7cd4204a28
|
Add TPU TRC acknowledge
|
2 years ago |
Ross Wightman
|
7d44d65bf5
|
Update README and changelogs
|
2 years ago |
Ross Wightman
|
d875a1d3f6
|
version 0.6.7
|
2 years ago |
Ross Wightman
|
c865028c34
|
Update benchmark with latest model adds
|
2 years ago |
Ross Wightman
|
30bd1746c5
|
Improve csv table result processing for better sort when updating
|
2 years ago |
Ross Wightman
|
e987e29036
|
Add convnext_nano and few cs3 models to existing results tables
|
2 years ago |
Ross Wightman
|
6f103a442b
|
Add convnext_nano weights, 80.8 @ 224, 81.5 @ 288
|
2 years ago |
Ross Wightman
|
4042a94f8f
|
Add weights for two 'Edge' block (3x3->1x1) variants of CS3 networks.
|
2 years ago |
Ross Wightman
|
c8f69e04a9
|
Merge pull request #1365 from veritable-tech/fix-resize-pos-embed
Take `no_emb_class` into account when calling `resize_pos_embed`
|
2 years ago |
Ross Wightman
|
99af63ca92
|
Merge pull request #1277 from lukasugar/patch-1
Add missing output in Feature extraction docs
|
2 years ago |
Ross Wightman
|
45c447fc15
|
Merge pull request #1363 from Jasha10/patch-1
Update type hint for `register_notrace_module`
|
2 years ago |
Ceshine Lee
|
0b64117592
|
Take `no_emb_class` into account when calling `resize_pos_embed`
|
2 years ago |
Jasha10
|
56c3a84db3
|
Update type hint for `register_notrace_module`
register_notrace_module is used to decorate types (i.e. subclasses of nn.Module).
It is not called on module instances.
|
2 years ago |
Ross Wightman
|
d7b55a9429
|
Add gmacs and macts columns to inference benchmark (missed profile in initial run)
|
2 years ago |
Ross Wightman
|
1b278136c3
|
Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355
|
2 years ago |
Ross Wightman
|
909705e7ff
|
Remove some redundant requires_grad=True from nn.Parameter in third party code
|
2 years ago |
Ross Wightman
|
c5e0d1c700
|
Add dilation support to convnext, allows output_stride=8 and 16 use. Fix #1341
|
2 years ago |
Ross Wightman
|
5e7d47ca10
|
Add pytorch 1.12 benchmark csv files w/ 0.6.6 code. Remove pytorch 1.10 results. Deciding whether to update 1.11 results or remove...
|
2 years ago |
Ross Wightman
|
dc376e3676
|
Ensure all model entrypoint fn default to `pretrained=False` (a few didn't)
|
2 years ago |
Ross Wightman
|
23b102064a
|
Add cs3sedarknet_x weights w/ 82.65 @ 288 top1. Add 2 cs3 edgenet models (w/ 3x3-1x1 block), remove aa from cspnet blocks (not needed)
|
2 years ago |
Ross Wightman
|
0dbd9352ce
|
Add bulk_runner script and updates to benchmark.py and validate.py for better error handling in bulk runs (used for benchmark and validation result runs). Improved batch size decay stepping on retry...
|
2 years ago |
Ross Wightman
|
4547920f85
|
Merge pull request #1354 from rwightman/fix_tests
Attempting to fix unit test failures...
|
2 years ago |
Ross Wightman
|
29afe79c8b
|
Attempt to fix unit tests by removing subset of tests on mac runner
|
2 years ago |
Ross Wightman
|
326ade2999
|
Add updated validation / test set results, benchmarks still running...
|
2 years ago |
Ross Wightman
|
92b91af3bb
|
version 0.6.6
|
2 years ago |
Ross Wightman
|
05313940e2
|
Add cs3darknet_x, cs3sedarknet_l, and darknetaa53 weights from TPU sessions. Move SE btwn conv1 & conv2 in DarkBlock. Improve SE/attn handling in Csp/DarkNet. Fix leaky_relu bug on older csp models.
|
2 years ago |
Ross Wightman
|
4283c0c478
|
Merge pull request #1351 from nateraw/use-hf-hub-download
Use hf_hub_download instead of cached_download
|
2 years ago |
nateraw
|
51cca82aa1
|
👽 use hf_hub_download instead of cached_download
|
2 years ago |
Ross Wightman
|
324a4e58b6
|
disable nvfuser for jit te/legacy modes (for PT 1.12+)
|
2 years ago |