Commit Graph

108 Commits (ab16a358bb5ae7d7e2cdd78c90dcdd01d972963a)

Author SHA1 Message Date
Ross Wightman ab16a358bb Add log and continue handler for WDS errors, fix args.num_gpu for validation script fallback
3 years ago
Ross Wightman 7eeaf521a0 use gopen in wds to open info file in case it's at a url/gs location
3 years ago
Ross Wightman 229ac6b8d8 Fix alternate label handling in WDS parser to skip invalid alt labels
3 years ago
Ross Wightman a444d4b891 Add alternative label support to WDS for imagenet22k/12k split, add 21k/22k/12k indices filters to results/
3 years ago
Ross Wightman da2796ae82 Add webdataset (WDS) support, update TFDS to make some naming in parsers more similar. Fix workers=0 compatibility. Add ImageNet22k/12k synset defs.
3 years ago
Ross Wightman c639a86c67 Change TFDS default to full re-shuffle (init) each epoch (for now)
3 years ago
Ross Wightman a16ea1e355 Merge remote-tracking branch 'origin/norm_norm_norm' into bits_and_tpu
3 years ago
Ross Wightman 372ad5fa0d Significant model refactor and additions:
3 years ago
Ross Wightman 066e490605 Merge branch 'norm_norm_norm' into bits_and_tpu
3 years ago
Ross Wightman 95cfc9b3e8 Merge remote-tracking branch 'origin/master' into norm_norm_norm
3 years ago
Ross Wightman 656757d26b Fix MobileNetV2 head conv size for multiplier < 1.0. Add some missing modification copyrights, fix starting date of some old ones.
3 years ago
Ross Wightman 02ae11e526 Leaving repeat aug sampler indices as tensor thrashes worker shared process memory
3 years ago
Ross Wightman de5fa791c6 Merge branch 'master' into norm_norm_norm
3 years ago
Hyeongchan Kim a0b2657497
Use `torch.repeat_interleave()` to generate repeated indices faster (#1058)
3 years ago
Ross Wightman ff0f709c20 Testing TFDS shuffle across epochs
3 years ago
Ross Wightman cad170e494 Merge remote-tracking branch 'origin/norm_norm_norm' into bits_and_tpu
3 years ago
Ross Wightman 55adfbeb8d Add commented code to increase open file limit via Python (for TFDS dataset building)
3 years ago
Ross Wightman 809c7bb1ec Merge remote-tracking branch 'origin/master' into bits_and_tpu
3 years ago
Ross Wightman 78b36bf46c Places365 doesn't exist in some still used torchvision version
3 years ago
Ross Wightman cfa414cad2 Matching two bits_and_tpu changes for TFDs wrapper
3 years ago
Ross Wightman 4f338556d8 Fixes and improvements for metrics, tfds parser, loader / transform handling
3 years ago
Ross Wightman d9b0b3d60f device arg wasn't removed from PrefetcherCuda instantiation of RE
3 years ago
Ross Wightman 406c486ba2 Merge remote-tracking branch 'origin/more_datasets' into bits_and_tpu
3 years ago
Ross Wightman 9ec3210c2d More TFDS parser cleanup, support improved TFDS even_split impl (on tfds-nightly only currently).
3 years ago
Ross Wightman ba65dfe2c6 Dataset work
3 years ago
Ross Wightman a41de1f666 Add interpolation mode handling to transforms. Removes InterpolationMode warning. Works for torchvision versions w/ and w/o InterpolationMode enum. Fix #738.
3 years ago
Ross Wightman ed41d32637 Add repr to auto_augment and random_erasing impl
3 years ago
Ross Wightman 3b6ba76126 Merge remote-tracking branch 'origin/master' into bits_and_tpu
3 years ago
masafumi 047a5ec05f Fix bugs that Mixup does not work device=cpu
3 years ago
Ross Wightman 80075b0b8a Add worker_seeding arg to allow selecting old vs updated data loader worker seed for (old) experiment repeatability
3 years ago
Ross Wightman 25d52ea71d Merge remote-tracking branch 'origin/fixes_bce_regnet' into bits_and_tpu
3 years ago
Ross Wightman f8a63a3b71 Add worker_init_fn to loader for numpy seed per worker
3 years ago
Ross Wightman c2f02b08b8 Merge remote-tracking branch 'origin/attn_update' into bits_and_tpu
3 years ago
Ross Wightman fb94350896 Update training script and loader factory to allow use of scheduler updates, repeat augment, and bce loss
3 years ago
Ross Wightman f262137ff2 Add RepeatAugSampler as per DeiT RASampler impl, showing promise for current (distributed) training experiments.
3 years ago
Ross Wightman cb621e0f00 Remove print, arg order
3 years ago
Ross Wightman c06c739901 Merge branch 'master' into bits_and_tpu
3 years ago
Ross Wightman 40457e5691 Transforms, augmentation work for bits, add RandomErasing support for XLA (pushing into transforms), revamp of transform/preproc config, etc ongoing...
3 years ago
Ross Wightman 3cdaf5ed56 Add `mmax` config key to auto_augment for increasing upper bound of RandAugment magnitude beyond 10. Make AugMix uniform sampling default not override config setting.
3 years ago
Ross Wightman b57a03bd0d Merge branch 'master' into bits_and_tpu
4 years ago
Ross Wightman 91ab0b6ce5 Add proper TrainState checkpoint save/load. Some reorg/refactoring and other cleanup. More to go...
4 years ago
Ross Wightman 74d2829341 Merge branch 'master' into bits_and_tpu
4 years ago
Ross Wightman aa92d7b1c5 Major timm.bits update. Updater and DeviceEnv now dataclasses, after_step closure used, metrics base impl w/ distributed reduce, many tweaks/fixes.
4 years ago
Ross Wightman d53e91218e Fix tf.data options setting for newer TF versions
4 years ago
Ross Wightman 938716c753 Fix import issue, use devenv for dist info in parser_tfds
4 years ago
Ross Wightman 76de984a5f Fix some bugs with XLA support, logger, add hacky xla dist launch script since torch.dist.launch doesn't work
4 years ago
Ross Wightman 12d9a6d4d2 First timm.bits commit, add initial abstractions, WIP updates to train, val... some of it working
4 years ago
Norman Mu 79640fcc1f Enable uniform augmentation magnitude sampling and set AugMix default
4 years ago
Ross Wightman a5310a3451 Merge remote-tracking branch 'origin/benchmark-fixes-vit_hybrids' into pit_and_vit_update
4 years ago
Ross Wightman f42f1df26c Improve evenness of per-worker split for validation set with TFDS
4 years ago