flash-attention/training/configs/scheduler/poly-warmup.yaml
2022-11-28 17:34:40 -08:00

3 lines
92 B
YAML

# @package train.scheduler
_target_: transformers.get_polynomial_decay_schedule_with_warmup