You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Ross Wightman
e2b8d44ff0
Halo, bottleneck attn, lambda layer additions and cleanup along w/ experimental model defs
...
* align interfaces of halo, bottleneck attn and lambda layer
* add qk_ratio to all of above, control q/k dim relative to output dim
* add experimental haloregnetz, and trionet (lambda + halo + bottle) models
3 years ago
..
data
Add worker_seeding arg to allow selecting old vs updated data loader worker seed for (old) experiment repeatability
3 years ago
loss
Update binary cross ent impl to use thresholding as an option (convert soft targets from mixup/cutmix to 0, 1)
3 years ago
models
Halo, bottleneck attn, lambda layer additions and cleanup along w/ experimental model defs
3 years ago
optim
Add lamb/lars to optim init imports, remove stray comment
3 years ago
scheduler
LR scheduler update:
3 years ago
utils
Tweak accuracy topk safety. Fix #807
3 years ago
__init__.py
Add default_cfg options for min_input_size / fixed_input_size, queries in model registry, and use for testing self-attn models
4 years ago
version.py
Version bump to 0.5.0 for pending release post RSB and ATTN updates
3 years ago