flash-attention/csrc/flash_attn/src
2024-01-19 16:09:31 -08:00
..
alibi.h Make Alibi an object 2024-01-15 00:07:11 -08:00
block_info.h Allow varlen_fwd to take optional seqused_k (#647) 2023-11-27 00:41:23 -08:00
dropout.h Move masking to a separate file (mask.h) 2024-01-14 12:43:47 -08:00
flash_bwd_hdim32_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim32_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim64_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim64_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim96_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim96_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim128_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim128_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim160_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim160_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim192_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim192_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim224_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim224_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim256_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_hdim256_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_bwd_kernel.h Make Alibi an object 2024-01-15 00:07:11 -08:00
flash_bwd_launch_template.h Move bwd preprocess kernels to a separate file 2024-01-14 16:57:03 -08:00
flash_bwd_preprocess_kernel.h Move bwd preprocess kernels to a separate file 2024-01-14 16:57:03 -08:00
flash_fwd_hdim32_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim32_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim64_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim64_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim96_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim96_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim128_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim128_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim160_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim160_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim192_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim192_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim224_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim224_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim256_bf16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_hdim256_fp16_sm80.cu Use generate_kernels.py script from Driss Guessous 2023-08-28 13:34:12 -07:00
flash_fwd_kernel.h Make Softmax an object 2024-01-19 16:09:31 -08:00
flash_fwd_launch_template.h Implement deterministic backward (thanks to Meituan) 2023-12-23 17:57:36 -08:00
flash_fwd_split_hdim32_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim32_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim64_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim64_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim96_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim96_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim128_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim128_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim160_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim160_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim192_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim192_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim224_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim224_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim256_bf16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash_fwd_split_hdim256_fp16_sm80.cu Implement splitKV attention 2023-08-29 00:58:29 -07:00
flash.h Implement deterministic backward (thanks to Meituan) 2023-12-23 17:57:36 -08:00
generate_kernels.py Implement splitKV attention 2023-08-29 00:58:29 -07:00
kernel_traits_sm90.h FlashAttention-2 release 2023-07-17 06:21:34 -07:00
kernel_traits.h Move masking to a separate file (mask.h) 2024-01-14 12:43:47 -08:00
mask.h Move masking to a separate file (mask.h) 2024-01-14 12:43:47 -08:00
philox.cuh Remove dead code in philox.cuh 2024-01-13 02:02:03 -08:00
softmax.h Make Softmax an object 2024-01-19 16:09:31 -08:00
static_switch.h Fix compile error on MSVC 2023-07-19 08:04:57 +00:00
utils.h apply_dropout now takes tensor of rowcol layout 2024-01-14 01:03:23 -08:00