flash-attention/training/configs/scheduler/cosine-warmup.yaml
2022-11-28 17:34:40 -08:00

3 lines
82 B
YAML

# @package train.scheduler
_target_: transformers.get_cosine_schedule_with_warmup