flash-attention/tests/losses
2022-12-26 16:22:43 -08:00
..
test_cross_entropy_parallel.py Implement Tensor Parallel for GPT model 2022-12-26 16:22:43 -08:00
test_cross_entropy.py Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss 2022-12-23 14:51:08 -08:00