pytorch-image-models

Commit Graph

Author	SHA1	Message	Date
Ross Wightman	c16e965037	Add some ViT comments and fix a few minor issues.	3 years ago
Ross Wightman	55f7dfa9ea	Refactor vision_transformer entrpy fns, add pos embedding resize support for fine tuning, add some deit models for testing	3 years ago
Ross Wightman	855d6cc217	More dataset work including factories and a tensorflow datasets (TFDS) wrapper * Add parser/dataset factory methods for more flexible dataset & parser creation * Add dataset parser that wraps TFDS image classification datasets * Tweak num_classes handling bug for 21k models * Add initial deit models so they can be benchmarked in next csv results runs	4 years ago
Ross Wightman	ce69de70d3	Add 21k weight urls to vision_transformer. Cleanup feature_info for preact ResNetV2 (BiT) models	4 years ago
Ross Wightman	231d04e91a	ResNetV2 pre-act and non-preact model, w/ BiT pretrained weights and support for ViT R50 model. Tweaks for in21k num_classes passing. More to do... tests failing.	4 years ago
Ross Wightman	b401952caf	Add newly added vision transformer large/base 224x224 weights ported from JAX official repo	4 years ago
Ross Wightman	61200db0ab	in_chans=1 working w/ pretrained weights for vision_transformer	4 years ago
Ross Wightman	f591e90b0d	Make sure num_features attr is present in vit models as with others	4 years ago
Ross Wightman	f944242cb0	Fix #262 , num_classes arg mixup. Make vision_transformers a bit closer to other models wrt get/reset classfier/forward_features. Fix torchscript for ViT.	4 years ago
Ross Wightman	736f209e7d	Update vision transformers to be compatible with official code. Port official ViT weights from jax impl.	4 years ago
Ross Wightman	27a93e9de7	Improve test crop for ViT models. Small now 77.85, added base weights at 79.35 top-1.	4 years ago
Ross Wightman	d4db9e7977	Add small vision transformer weights. 77.42 top-1.	4 years ago
Ross Wightman	f31933cb37	Initial Vision Transformer impl w/ patch and hybrid variants. Refactor tuple helpers.	4 years ago

13 Commits (c16e9650371d167dcb38669aa1280caba2c69dcd)