flash-attention/csrc
2024-07-23 01:32:09 -07:00
..
composable_kernel@8182976c37 Support AMD ROCm on FlashAttention 2 (#1010) 2024-07-22 21:34:37 -07:00
cutlass@756c351b49 [FA3] BF16 forward 2024-07-14 23:39:46 -07:00
flash_attn Split bwd into more .cu files to speed up compilation 2024-07-23 01:32:09 -07:00
flash_attn_ck Support AMD ROCm on FlashAttention 2 (#1010) 2024-07-22 21:34:37 -07:00
ft_attention Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
fused_dense_lib Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
fused_softmax Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
layer_norm Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
rotary Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
xentropy Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00