|
models
|
Implement last_layer_subset optimization for BERT
|
2022-12-19 22:18:46 -08:00 |
|
modules
|
Implement Tensor Parallel for GPT2Embeddings
|
2022-12-25 14:29:53 -08:00 |
|
test_flash_attn.py
|
Skip flash_attn_split test
|
2022-11-13 12:27:48 -08:00 |
|
test_rotary.py
|
Add MLP, MHA, Block, Embedding modules
|
2022-11-13 22:06:44 -08:00 |