|
models
|
[Gen] Make generation work with Tensor Parallel
|
2023-01-15 11:34:27 -08:00 |
|
modules
|
[Gen] Make generation work with Tensor Parallel
|
2023-01-15 11:34:27 -08:00 |
|
utils
|
[Gen] Make generation work with Tensor Parallel
|
2023-01-15 11:34:27 -08:00 |
|
__init__.py
|
Add missing __init__.py
|
2022-07-03 02:04:55 -04:00 |
|
bert_padding.py
|
remove numpy dependency
|
2022-10-06 19:17:15 +02:00 |
|
flash_attention.py
|
Implement BERT
|
2022-12-18 21:47:27 -08:00 |
|
flash_blocksparse_attention.py
|
Rename src -> flash_attn
|
2022-06-01 18:50:26 -07:00 |