flash-attention/training/configs/optimizer/fusedlamb-ds.yaml

3 lines
66 B
YAML
Raw Normal View History

2022-11-29 09:31:19 +08:00
# @package train.optimizer
_target_: deepspeed.ops.lamb.FusedLamb