flash-attention/tests/losses
2023-11-19 23:19:36 -08:00
..
test_cross_entropy_parallel.py [CrossEntropy] Simplify the case of large vocab with Tensor Parallel 2023-11-19 23:19:36 -08:00
test_cross_entropy.py [CE] Implement CrossEntropyLoss in Triton 2023-09-15 20:05:28 -07:00