flash-attention/csrc/flash_attn
Tri Dao 1aa6d7d9b6 Rework dropout to decouple forward and backward
They don't have to have the same block size, number of threads, etc.
2022-10-21 12:04:27 -07:00
..
cutlass@319a389f42 Add Cutlass as submodule 2022-06-02 09:54:16 -07:00
src Rework dropout to decouple forward and backward 2022-10-21 12:04:27 -07:00
fmha_api.cpp Rework dropout to decouple forward and backward 2022-10-21 12:04:27 -07:00