pytorch-image-models

Commit Graph

Author	SHA1	Message	Date
Ross Wightman	444dcba4ad	CLIP B16 12k weights added	2 years ago
Ross Wightman	dff4717cbf	Add clip b16 384x384 finetunes	2 years ago
Ross Wightman	883fa2eeaa	Add fine-tuned B/16 224x224 in1k clip models	2 years ago
Ross Wightman	9a3d2ac2d5	Add latest CLIP ViT fine-tune pretrained configs / model entrypt updates	2 years ago
Ross Wightman	def68befa7	Updating vit model defs for mult-weight support trial (vit first). Prepping for CLIP (laion2b and openai) fine-tuned weights.	2 years ago
Ross Wightman	0dadb4a6e9	Initial multi-weight support, handled so old pretraing config handling co-exists with new tags.	2 years ago
Mohamed Rashad	8fda68aff6	Fix repo id bug This to fix this issue #1482	2 years ago
Ross Wightman	1199c5a1a4	clip_laion2b models need 1e-5 eps for LayerNorm	2 years ago
Ross Wightman	e069249a2d	Add hf hub entries for laion2b clip models, add huggingface_hub dependency, update some setup/reqs, torch >= 1.7	2 years ago
Ross Wightman	9d65557be3	Fix errant import	2 years ago
Ross Wightman	9709dbaaa9	Adding support for fine-tune CLIP LAION-2B image tower weights for B/32, L/14, H/14 and g/14. Still WIP	2 years ago
Ross Wightman	e11efa872d	Update a bunch of weights with external links to timm release assets. Fixes issue with *aliyuncs.com returning forbidden. Did pickle scan / verify and re-hash. Add TresNet-V2-L weights.	2 years ago
Ceshine Lee	0b64117592	Take `no_emb_class` into account when calling `resize_pos_embed`	2 years ago
Ross Wightman	1b278136c3	Change models with mean 0,0,0 std 1,1,1 from int to float for consistency as mentioned in #1355	2 years ago
Ross Wightman	a8e34051c1	Unbreak gamma remap impacting beit checkpoint load, version bump to 0.6.4	2 years ago
Ross Wightman	7d4b3807d5	Support DeiT-3 (Revenge of the ViT) checkpoints. Add non-overlapping (w/ class token) pos-embed support to vit.	2 years ago
Ross Wightman	7d657d2ef4	Improve resolve_pretrained_cfg behaviour when no cfg exists, warn instead of crash. Improve usability ex #1311	2 years ago
Ross Wightman	f5ca4141f7	Adjust arg order for recent vit model args, add a few comments	3 years ago
Ross Wightman	41dc49a337	Vision Transformer refactoring and Rel Pos impl	3 years ago
Ross Wightman	1618527098	Add layer scale and parallel blocks to vision_transformer	3 years ago
Ross Wightman	0862e6ebae	Fix correctness of some group matching regex (no impact on result), some formatting, missed forward_head for resnet	3 years ago
Ross Wightman	372ad5fa0d	Significant model refactor and additions: * All models updated with revised foward_features / forward_head interface * Vision transformer and MLP based models consistently output sequence from forward_features (pooling or token selection considered part of 'head') * WIP param grouping interface to allow consistent grouping of parameters for layer-wise decay across all model types * Add gradient checkpointing support to a significant % of models, especially popular architectures * Formatting and interface consistency improvements across models * layer-wise LR decay impl part of optimizer factory w/ scale support in scheduler * Poolformer and Volo architectures added	3 years ago
Ross Wightman	5f81d4de23	Move DeiT to own file, vit getting crowded. Working towards fixing #1029 , make pooling interface for transformers and mlp closer to convnets. Still working through some details...	3 years ago
Ross Wightman	95cfc9b3e8	Merge remote-tracking branch 'origin/master' into norm_norm_norm	3 years ago
Ross Wightman	abc9ba2544	Transitioning default_cfg -> pretrained_cfg. Improving handling of pretrained_cfg source (HF-Hub, files, timm config, etc). Checkpoint handling tweaks.	3 years ago
Ross Wightman	07379c6d5d	Add vit_base2_patch32_256 for a model between base_patch16 and patch32 with a slightly larger img size and width	3 years ago
Ross Wightman	010b486590	Add Dino pretrained weights (no head) for vit models. Add support to tests and helpers for models w/ no classifier (num_classes=0 in pretrained cfg)	3 years ago
Ross Wightman	e967c72875	Update REAMDE.md. Sneak in g/G (giant / gigantic?) ViT defs from scaling paper	3 years ago
Ross Wightman	656757d26b	Fix MobileNetV2 head conv size for multiplier < 1.0. Add some missing modification copyrights, fix starting date of some old ones.	3 years ago
Martins Bruveris	5220711d87	Added B/8 models to ViT.	3 years ago
Thomas Viehmann	f805ba86d9	use .unbind instead of explicitly listing the indices	3 years ago
Ross Wightman	78933122c9	Fix silly typo	3 years ago
Ross Wightman	708d87a813	Fix ViT SAM weight compat as weights at URL changed to not use repr layer. Fix #825 . Tweak optim test.	3 years ago
Ying Jin	20b2d4b69d	Use bicubic interpolation in resize_pos_embed()	3 years ago
Ross Wightman	6d8272e92c	Add SAM pretrained model defs/weights for ViT B16 and B32 models.	3 years ago
Ross Wightman	85f894e03d	Fix ViT in21k representation (pre_logits) layer handling across old and new npz checkpoints	3 years ago
Ross Wightman	b41cffaa93	Fix a few issues loading pretrained vit/bit npz weights w/ num_classes=0 __init__ arg. Missed a few other small classifier handling detail on Mlp, GhostNet, Levit. Should fix #713	3 years ago
Ross Wightman	9c9755a808	AugReg release	3 years ago
Ross Wightman	b319eb5b5d	Update ViT weights, more details to be added before merge.	3 years ago
Ross Wightman	b9cfb64412	Support npz custom load for vision transformer hybrid models. Add posembed rescale for npz load.	3 years ago
Ross Wightman	8880f696b6	Refactoring, cleanup, improved test coverage. * Add eca_nfnet_l2 weights, 84.7 @ 384x384 * All 'non-std' (ie transformer / mlp) models have classifier / default_cfg test added * Fix #694 reset_classifer / num_features / forward_features / num_classes=0 consistency for transformer / mlp models * Add direct loading of npz to vision transformer (pure transformer so far, hybrid to come) * Rename vit_deit* to deit_* * Remove some deprecated vit hybrid model defs * Clean up classifier flatten for conv classifiers and unusual cases (mobilenetv3/ghostnet) * Remove explicit model fns for levit conv, just pass in arg	3 years ago
Ross Wightman	bfc72f75d3	Expand scope of testing for non-std vision transformer / mlp models. Some related cleanup and create fn cleanup for all vision transformer and mlp models. More CoaT weights.	4 years ago
Ross Wightman	30b9880d06	Minor adjustment, mutable default arg, extra check of valid len...	4 years ago
Alexander Soare	8086943b6f	allow resize positional embeddings to non-square grid	4 years ago
Ross Wightman	b2c305c2aa	Move Mlp and PatchEmbed modules into layers. Being used in lots of models now...	4 years ago
Ross Wightman	a0492e3b48	A few miil weights naming tweaks to improve compat with model registry and filtering wildcards.	4 years ago
talrid	19e1b67a84	old spaces	4 years ago
talrid	a443865876	update naming and scores	4 years ago
talrid	cf0e371594	84_0	4 years ago
talrid	0968bdeca3	vit, tresnet and mobilenetV3 ImageNet-21K-P weights	4 years ago

1 2

82 Commits (444dcba4ad71b59e91c7f448854bfbf1dd1a3b14)