flash-attention/csrc/flash_attn
2023-09-10 22:56:33 -07:00
..
src Swap seqlen_q and nheads for MQA to speed it up (h/t Daniel Haziza) 2023-09-10 22:56:33 -07:00
flash_api.cpp Swap seqlen_q and nheads for MQA to speed it up (h/t Daniel Haziza) 2023-09-10 22:56:33 -07:00