Commit Graph

5 Commits

Author SHA1 Message Date
Tri Dao
009a3e71ec [Training] Fix lightning _PATH import 2023-03-29 01:43:39 -07:00
Tri Dao
71befc19e1 [Loss] Use flash_attn.losses.cross_entropy.CrossEntropyLoss 2022-12-31 22:43:28 -08:00
Tri Dao
b4018a5028 Implement Tensor Parallel for GPT model 2022-12-26 16:22:43 -08:00
Tri Dao
dff68c2b22 Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss 2022-12-23 14:51:08 -08:00
Tri Dao
0bf5e50038 Release training code 2022-11-28 17:34:40 -08:00