cutlass/examples/41_fused_multi_head_attention
Haicheng Wu 3f2bb17722
minor chagnes (#730)
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2022-12-10 14:44:53 -05:00
..
gemm releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
iterators releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
attention_scaling_coefs_updater.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
CMakeLists.txt releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
debug_utils.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
default_fmha_grouped.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
epilogue_pipelined.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
epilogue_rescale_output.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
epilogue_thread_apply_logsumexp.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
find_default_mma.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
fmha_grouped_problem_visitor.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
fmha_grouped.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
fused_multihead_attention_fixed_seqlen.cu minor chagnes (#730) 2022-12-10 14:44:53 -05:00
fused_multihead_attention_variable_seqlen.cu minor chagnes (#730) 2022-12-10 14:44:53 -05:00
gemm_kernel_utils.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00
kernel_forward.h minor chagnes (#730) 2022-12-10 14:44:53 -05:00
mma_from_smem.h releaase 2.11 (#703) 2022-11-19 09:02:15 -05:00