flash-attention/training/configs/scheduler/invsqrt.yaml

4 lines
90 B
YAML
Raw Normal View History

2022-11-29 09:31:19 +08:00
# @package train.scheduler
_target_: src.optim.lr_scheduler.InvSqrt
num_warmup_steps: ???