flash-attention/training/configs/optimizer/adamw.yaml
2022-11-28 17:34:40 -08:00

3 lines
55 B
YAML

# @package train.optimizer
_target_: torch.optim.AdamW