|
models
|
Implement last_layer_subset optimization for BERT
|
2022-12-19 22:18:46 -08:00 |
|
ops
|
Simplify FusedDense
|
2022-12-22 21:25:31 -08:00 |
|
test_flash_attn.py
|
Skip flash_attn_split test
|
2022-11-13 12:27:48 -08:00 |
|
test_rotary.py
|
Add MLP, MHA, Block, Embedding modules
|
2022-11-13 22:06:44 -08:00 |