Commit Graph

6 Commits

Author SHA1 Message Date
janEbert
3c4053b75c
Make FA3 externally importable (#1053)
Library name to import is `flash_attn_interface`, which matches the
test.
2024-07-22 21:34:56 -07:00
Ying Zhang
dfe1a59e4b
Add var-seq-len to FA3 fp16 / bf16 fwd (#1072)
* fwd var-seq-len

* fixes

* benchmark

* fixes

---------

Co-authored-by: Tri Dao <tridao@users.noreply.github.com>
2024-07-22 21:32:41 -07:00
Cameron Shinn
cb516f855b
Remove torchlib dependency from cpp files (#1083) 2024-07-22 16:47:09 -07:00
youkaichao
ef3e358a25
remove lambda (#1056) 2024-07-21 23:24:38 -07:00
Tri Dao
74b0761ff7 [FA3] BF16 forward 2024-07-14 23:39:46 -07:00
Tri Dao
7f67966cc7 FA3 initial code release 2024-07-11 09:53:36 -07:00