flash-attention/tests/losses
2023-12-16 19:11:23 -08:00
..
test_cross_entropy_parallel.py [CrossEntropy] Implement logit_scale option 2023-12-16 18:39:37 -08:00
test_cross_entropy.py [CrossEntropy] Test longer sequences 2023-12-16 19:11:23 -08:00