flash-attention/csrc/flash_attn
Kirthi Shankar Sivamani 45567a25a2 only 1 thread writes to global mem in fprop
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-04-15 06:09:41 +00:00
..
cutlass@319a389f42 Add Cutlass as submodule 2022-06-02 09:54:16 -07:00
src only 1 thread writes to global mem in fprop 2023-04-15 06:09:41 +00:00
fmha_api.cpp Handle FlashAttnQKVPackedSplitFunc by making rng_state optional in backward 2023-04-13 06:25:52 +00:00