pytorch-image-models/timm/scheduler/scheduler_factory.py

""" Scheduler Factory
Hacked together by / Copyright 2021 Ross Wightman
"""
from .cosine_lr import CosineLRScheduler
from .multistep_lr import MultiStepLRScheduler
from .plateau_lr import PlateauLRScheduler
from .poly_lr import PolyLRScheduler
from .step_lr import StepLRScheduler
from .tanh_lr import TanhLRScheduler


def create_scheduler(args, optimizer):
    num_epochs = args.epochs

    if getattr(args, 'lr_noise', None) is not None:
        lr_noise = getattr(args, 'lr_noise')
        if isinstance(lr_noise, (list, tuple)):
            noise_range = [n * num_epochs for n in lr_noise]
            if len(noise_range) == 1:
                noise_range = noise_range[0]
        else:
            noise_range = lr_noise * num_epochs
    else:
        noise_range = None
    noise_args = dict(
        noise_range_t=noise_range,
        noise_pct=getattr(args, 'lr_noise_pct', 0.67),
        noise_std=getattr(args, 'lr_noise_std', 1.),
        noise_seed=getattr(args, 'seed', 42),
    )
    cycle_args = dict(
        cycle_mul=getattr(args, 'lr_cycle_mul', 1.),
        cycle_decay=getattr(args, 'lr_cycle_decay', 0.1),
        cycle_limit=getattr(args, 'lr_cycle_limit', 1),
    )

    lr_scheduler = None
    if args.sched == 'cosine':
        lr_scheduler = CosineLRScheduler(
            optimizer,
            t_initial=num_epochs,
            lr_min=args.min_lr,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            k_decay=getattr(args, 'lr_k_decay', 1.0),
            **cycle_args,
            **noise_args,
        )
        num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs
    elif args.sched == 'tanh':
        lr_scheduler = TanhLRScheduler(
            optimizer,
            t_initial=num_epochs,
            lr_min=args.min_lr,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            t_in_epochs=True,
            **cycle_args,
            **noise_args,
        )
        num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs
    elif args.sched == 'step':
        lr_scheduler = StepLRScheduler(
            optimizer,
            decay_t=args.decay_epochs,
            decay_rate=args.decay_rate,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            **noise_args,
        )
    elif args.sched == 'multistep':
        lr_scheduler = MultiStepLRScheduler(
            optimizer,
            decay_t=args.milestones,
            decay_rate=args.decay_rate,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            **noise_args,
        )
    elif args.sched == 'plateau':
        mode = 'min' if 'loss' in getattr(args, 'eval_metric', '') else 'max'
        lr_scheduler = PlateauLRScheduler(
            optimizer,
            decay_rate=args.decay_rate,
            patience_t=args.patience_epochs,
            lr_min=args.min_lr,
            mode=mode,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            cooldown_t=0,
            **noise_args,
        )
    elif args.sched == 'poly':
        lr_scheduler = PolyLRScheduler(
            optimizer,
            power=args.decay_rate,  # overloading 'decay_rate' as polynomial power
            t_initial=num_epochs,
            lr_min=args.min_lr,
            warmup_lr_init=args.warmup_lr,
            warmup_t=args.warmup_epochs,
            k_decay=getattr(args, 'lr_k_decay', 1.0),
            **cycle_args,
            **noise_args,
        )
        num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs

    return lr_scheduler, num_epochs
Fix some attributions, add copyrights to some file docstrings 4 years ago			`""" Scheduler Factory`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`Hacked together by / Copyright 2021 Ross Wightman`
Fix some attributions, add copyrights to some file docstrings 4 years ago			`"""`
Big re-org, working towards making pip/module as 'timm' 5 years ago			`from .cosine_lr import CosineLRScheduler`
1. Added a simple multi step LR scheduler 3 years ago			`from .multistep_lr import MultiStepLRScheduler`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`from .plateau_lr import PlateauLRScheduler`
			`from .poly_lr import PolyLRScheduler`
			`from .step_lr import StepLRScheduler`
			`from .tanh_lr import TanhLRScheduler`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago

			`def create_scheduler(args, optimizer):`
			`num_epochs = args.epochs`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max. 4 years ago			`if getattr(args, 'lr_noise', None) is not None:`
			`lr_noise = getattr(args, 'lr_noise')`
			`if isinstance(lr_noise, (list, tuple)):`
			`noise_range = [n * num_epochs for n in lr_noise]`
Add MobileNetV3 Large weights, results, update README and sotabench for merge 5 years ago			`if len(noise_range) == 1:`
			`noise_range = noise_range[0]`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago			`else:`
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max. 4 years ago			`noise_range = lr_noise * num_epochs`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago			`else:`
			`noise_range = None`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`noise_args = dict(`
			`noise_range_t=noise_range,`
			`noise_pct=getattr(args, 'lr_noise_pct', 0.67),`
			`noise_std=getattr(args, 'lr_noise_std', 1.),`
			`noise_seed=getattr(args, 'seed', 42),`
			`)`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`cycle_args = dict(`
			`cycle_mul=getattr(args, 'lr_cycle_mul', 1.),`
			`cycle_decay=getattr(args, 'lr_cycle_decay', 0.1),`
			`cycle_limit=getattr(args, 'lr_cycle_limit', 1),`
			`)`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago
Add RAdam, NovoGrad, Lookahead, and AdamW optimizers, a few ResNet tweaks and scheduler factory tweak. * Add some of the trendy new optimizers. Decent results but not clearly better than the standards. * Can create a None scheduler for constant LR * ResNet defaults to zero_init of last BN in residual * add resnet50d config 5 years ago			`lr_scheduler = None`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`if args.sched == 'cosine':`
			`lr_scheduler = CosineLRScheduler(`
			`optimizer,`
			`t_initial=num_epochs,`
Make min-lr and cooldown-epochs cmdline args, change dash in color_jitter arg for consistency 5 years ago			`lr_min=args.min_lr,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`k_decay=getattr(args, 'lr_k_decay', 1.0),`
			`**cycle_args,`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`**noise_args,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`)`
Make min-lr and cooldown-epochs cmdline args, change dash in color_jitter arg for consistency 5 years ago			`num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`elif args.sched == 'tanh':`
			`lr_scheduler = TanhLRScheduler(`
			`optimizer,`
			`t_initial=num_epochs,`
Make min-lr and cooldown-epochs cmdline args, change dash in color_jitter arg for consistency 5 years ago			`lr_min=args.min_lr,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
			`t_in_epochs=True,`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`**cycle_args,`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`**noise_args,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`)`
Make min-lr and cooldown-epochs cmdline args, change dash in color_jitter arg for consistency 5 years ago			`num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs`
Add RAdam, NovoGrad, Lookahead, and AdamW optimizers, a few ResNet tweaks and scheduler factory tweak. * Add some of the trendy new optimizers. Decent results but not clearly better than the standards. * Can create a None scheduler for constant LR * ResNet defaults to zero_init of last BN in residual * add resnet50d config 5 years ago			`elif args.sched == 'step':`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`lr_scheduler = StepLRScheduler(`
			`optimizer,`
			`decay_t=args.decay_epochs,`
1. Added a simple multi step LR scheduler 3 years ago			`decay_rate=args.decay_rate,`
			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`**noise_args,`
1. Added a simple multi step LR scheduler 3 years ago			`)`
			`elif args.sched == 'multistep':`
			`lr_scheduler = MultiStepLRScheduler(`
			`optimizer,`
fix: multistep lr decay epoch bugs - add milestones arguments - change decay_epochs to milestones variable 3 years ago			`decay_t=args.milestones,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`decay_rate=args.decay_rate,`
			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`**noise_args,`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago			`)`
			`elif args.sched == 'plateau':`
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max. 4 years ago			`mode = 'min' if 'loss' in getattr(args, 'eval_metric', '') else 'max'`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago			`lr_scheduler = PlateauLRScheduler(`
			`optimizer,`
			`decay_rate=args.decay_rate,`
			`patience_t=args.patience_epochs,`
			`lr_min=args.min_lr,`
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max. 4 years ago			`mode=mode,`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
Update README with model results and attribution. Make scheduler factory bit more robust to arg differences, add noise to plateau lr and fix min/max. 4 years ago			`cooldown_t=0,`
Update scheduler_factory.py remove duplicate code from create_scheduler() 3 years ago			`**noise_args,`
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`)`
LR scheduler update: * add polynomial decay 'poly' * cleanup cycle specific args for cosine, poly, and tanh sched, t_mul -> cycle_mul, decay -> cycle_decay, default cycle_limit to 1 in each opt * add k-decay for cosine and poly sched as per https://arxiv.org/abs/2004.05909 * change default tanh ub/lb to push inflection to later epochs 3 years ago			`elif args.sched == 'poly':`
			`lr_scheduler = PolyLRScheduler(`
			`optimizer,`
			`power=args.decay_rate, # overloading 'decay_rate' as polynomial power`
			`t_initial=num_epochs,`
			`lr_min=args.min_lr,`
			`warmup_lr_init=args.warmup_lr,`
			`warmup_t=args.warmup_epochs,`
			`k_decay=getattr(args, 'lr_k_decay', 1.0),`
			`**cycle_args,`
			`**noise_args,`
			`)`
			`num_epochs = lr_scheduler.get_cycle_length() + args.cooldown_epochs`
Revamp LR noise, move logic to scheduler base. Fixup PlateauLRScheduler and add it as an option. 5 years ago
Uniform pretrained model handling. * All models have 'default_cfgs' dict * load/resume/pretrained helpers factored out * pretrained load operates on state_dict based on default_cfg * test all models in validate * schedule, optim factor factored out * test time pool wrapper applied based on default_cfg 6 years ago			`return lr_scheduler, num_epochs`