Tri Dao
|
c7f32a8409
|
[CrossEntropy] Support precomputed LSE
|
2024-09-08 09:24:43 -07:00 |
|
Curtis "Fjord" Hawthorne
|
d8aacc510c
|
return z_loss (#768)
|
2024-01-21 15:23:41 -08:00 |
|
Tri Dao
|
713bd3aa9a
|
[CrossEntropy] Test longer sequences
|
2023-12-16 19:11:23 -08:00 |
|
Tri Dao
|
08124c8f9c
|
[CrossEntropy] Implement logit_scale option
|
2023-12-16 18:39:37 -08:00 |
|
Tri Dao
|
5400fdc4ac
|
[CE] Implement CrossEntropyLoss in Triton
|
2023-09-15 20:05:28 -07:00 |
|
Tri Dao
|
0e8c46ae08
|
Run isort and black on test files
|
2023-08-18 20:59:35 -07:00 |
|
Tri Dao
|
dff68c2b22
|
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
|
2022-12-23 14:51:08 -08:00 |
|