cutlass/examples/41_fused_multi_head_attention
dan_the_3rd 146d314057
Update fMHA kernels (#992)
* Update fMHA kernels

Upstream recent changes to fMHA that we did in xFormers.
Previous version in CUTLASS: facebookresearch/xformers@b6be33a
Updating to: facebookresearch/xformers@55a4798

* minor changes

* make var work

---------

Co-authored-by: danthe3rd <danthe3rd>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2023-07-12 22:30:46 -04:00
..
epilogue fMHA: Sync FW with xFormers (#828) 2023-02-22 23:25:31 -05:00
gemm Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
iterators Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
transform Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
CMakeLists.txt fMHA: Add backward pass (#844) 2023-04-06 20:44:58 -04:00
debug_utils.h fMHA: Sync FW with xFormers (#828) 2023-02-22 23:25:31 -05:00
default_fmha_grouped.h Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
fmha_backward_test.py Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
fmha_grouped_problem_visitor.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
fmha_grouped.h Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
fused_multi_head_attention_backward.cu Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
fused_multihead_attention_fixed_seqlen.cu Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
fused_multihead_attention_variable_seqlen.cu Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
gemm_kernel_utils.h Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
kernel_backward.h Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
kernel_forward.h Update fMHA kernels (#992) 2023-07-12 22:30:46 -04:00
piped_subprocess.py fMHA: Add backward pass (#844) 2023-04-06 20:44:58 -04:00