flash-attention/tests/losses
Curtis "Fjord" Hawthorne d8aacc510c
return z_loss (#768)
2024-01-21 15:23:41 -08:00
..
test_cross_entropy_parallel.py [CrossEntropy] Implement logit_scale option 2023-12-16 18:39:37 -08:00
test_cross_entropy.py return z_loss (#768) 2024-01-21 15:23:41 -08:00