flash-attention/tests/losses
2022-12-27 10:47:43 -08:00
..
test_cross_entropy_parallel.py Tweak CrossEntropyLoss to take process_group in init 2022-12-27 10:47:43 -08:00
test_cross_entropy.py Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss 2022-12-23 14:51:08 -08:00