flash-attention/csrc/flash_attn
Tri Dao 321c57d07d Set block size of SM75 fwd to 256 if there's no dropout
This speeds up the fwd by 1.5x.
2022-06-04 16:51:28 -07:00
..
cutlass@319a389f42 Add Cutlass as submodule 2022-06-02 09:54:16 -07:00
src Set block size of SM75 fwd to 256 if there's no dropout 2022-06-04 16:51:28 -07:00
fmha_api.cpp Set block size of SM75 fwd to 256 if there's no dropout 2022-06-04 16:51:28 -07:00