flash-attention/csrc
Nicolas Patry 5bf201966a
Fixing argument checking when using seqlenq_ngroups_swapped. (#976)
When user send `out` as a parameter of the function
`seqlenq_ngroups_swapped` with parameters that trigger,
the CHECK_SHAPE is incorrect (since q shape is modified.)
2024-06-30 22:39:22 -07:00
..
cutlass@7d49e6c7e2 Update to Cutlass 3.5 2024-05-26 12:49:33 -07:00
flash_attn Fixing argument checking when using seqlenq_ngroups_swapped. (#976) 2024-06-30 22:39:22 -07:00
ft_attention Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
fused_dense_lib Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
fused_softmax Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
layer_norm Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
rotary Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00
xentropy Make nvcc threads configurable via environment variable (#885) 2024-03-13 20:46:57 -07:00