cutlass/include/cutlass/gemm/device
Yujia Zhai cc3c29a81a
CUTLASS 3.6.0 (#1850)
* v3.6

* update changelog

* update readme

* fix typo

* fixing typos

* hopper gemm with weight prefetch

---------

Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2024-10-09 15:33:27 -04:00
..
base_grouped.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
default_gemm_configuration.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
ell_gemm.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_array.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_batched.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_complex.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_grouped.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_layernorm_mainloop_fusion.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_sparse_universal_with_absmax.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
gemm_sparse_universal.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
gemm_sparse_with_absmax.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_sparse_with_visitor.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_sparse.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_splitk_parallel.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_universal_adapter.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_universal_base.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemm_universal_streamk_with_broadcast.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_universal_with_absmax.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
gemm_universal_with_broadcast.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_universal.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
gemm_with_k_reduction.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
gemv.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
rank_2k_grouped.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
rank_2k.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
rank_k.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
symm.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
trmm.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00