.. |
00_basic_gemm
|
Fix typos 2 (#842)
|
2023-03-09 23:22:56 -05:00 |
01_cutlass_utilities
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
02_dump_reg_shmem
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
03_visualize_layout
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
04_tile_iterator
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
05_batched_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
06_splitK_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
07_volta_tensorop_gemm
|
Fix typos 2 (#842)
|
2023-03-09 23:22:56 -05:00 |
08_turing_tensorop_gemm
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
09_turing_tensorop_conv2dfprop
|
CUTLASS 3.2.1 (#1113)
|
2023-09-26 17:24:26 -04:00 |
10_planar_complex
|
CUTLASS 3.2 (#1024)
|
2023-08-07 20:50:32 -04:00 |
11_planar_complex_array
|
CUTLASS 3.2 (#1024)
|
2023-08-07 20:50:32 -04:00 |
12_gemm_bias_relu
|
fix typo (#1279)
|
2024-01-04 12:41:39 -05:00 |
13_two_tensor_op_fusion
|
CUTLASS 3.2.1 (#1113)
|
2023-09-26 17:24:26 -04:00 |
14_ampere_tf32_tensorop_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
15_ampere_sparse_tensorop_gemm
|
Add support for sparse GEMM with visitor epilogue (#1189)
|
2024-01-04 12:38:11 -05:00 |
16_ampere_tensorop_conv2dfprop
|
style(examples): typo (#1080)
|
2023-09-11 10:13:22 -04:00 |
17_fprop_per_channel_bias
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
18_ampere_fp64_tensorop_affine2_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
19_tensorop_canonical
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
20_simt_canonical
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
21_quaternion_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
22_quaternion_conv
|
CUTLASS 3.1 (#915)
|
2023-04-14 23:19:34 -04:00 |
23_ampere_gemm_operand_reduction_fusion
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
24_gemm_grouped
|
CUTLASS 3.2.1 (#1113)
|
2023-09-26 17:24:26 -04:00 |
25_ampere_fprop_mainloop_fusion
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
26_ampere_wgrad_mainloop_fusion
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
27_ampere_3xtf32_fast_accurate_tensorop_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
28_ampere_3xtf32_fast_accurate_tensorop_fprop
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
29_ampere_3xtf32_fast_accurate_tensorop_complex_gemm
|
CUTLASS 3.1 (#915)
|
2023-04-14 23:19:34 -04:00 |
30_wgrad_split_k
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
31_basic_syrk
|
Updates for 3.1 (#932)
|
2023-04-29 09:34:27 -04:00 |
32_basic_trmm
|
Updates for 3.1 (#932)
|
2023-04-29 09:34:27 -04:00 |
33_ampere_3xtf32_tensorop_symm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
34_transposed_conv2d
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
35_gemm_softmax
|
Increase max dynamic SMEM size in GemmSoftmax (#903)
|
2023-04-03 10:01:12 -04:00 |
36_gather_scatter_fusion
|
CUTLASS 3.2 (#1024)
|
2023-08-07 20:50:32 -04:00 |
37_gemm_layernorm_gemm_fusion
|
CUTLASS 3.0.0 (#786)
|
2023-01-23 20:55:28 -05:00 |
38_syr2k_grouped
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
39_gemm_permute
|
CUTLASS 3.2 (#1024)
|
2023-08-07 20:50:32 -04:00 |
40_cutlass_py
|
CUTLASS 3.2.1 (#1113)
|
2023-09-26 17:24:26 -04:00 |
41_fused_multi_head_attention
|
Adding missing typename (#1191)
|
2023-11-29 00:20:20 -05:00 |
42_ampere_tensorop_group_conv
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
43_ell_block_sparse_gemm
|
New updates for 2.11 (#775)
|
2023-01-20 16:32:57 -05:00 |
44_multi_gemm_ir_and_codegen
|
Fix several typos (#1169)
|
2023-11-02 23:54:46 -04:00 |
45_dual_gemm
|
Replace 0x1f with 0xffffffff in __shfl_sync (#1097)
|
2023-09-18 19:58:19 -04:00 |
46_depthwise_simt_conv2dfprop
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
47_ampere_gemm_universal_streamk
|
CUTLASS 3.3.0 (#1167)
|
2023-11-02 11:09:05 -04:00 |
48_hopper_warp_specialized_gemm
|
CUTLASS 3.3.0 (#1167)
|
2023-11-02 11:09:05 -04:00 |
49_hopper_gemm_with_collective_builder
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
50_hopper_gemm_with_epilogue_swizzle
|
CUTLASS 3.2 (#1024)
|
2023-08-07 20:50:32 -04:00 |
51_hopper_gett
|
Collection of changes to fix clang build. (#1200)
|
2023-12-08 14:42:12 -05:00 |
52_hopper_gather_scatter_fusion
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
53_hopper_gemm_permute
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
54_hopper_fp8_warp_specialized_gemm
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
55_hopper_mixed_dtype_gemm
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
56_hopper_ptr_array_batched_gemm
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
57_hopper_grouped_gemm
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
60_cutlass_import
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
common
|
CUTLASS 3.1 (#915)
|
2023-04-14 23:19:34 -04:00 |
cute
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
python
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |
CMakeLists.txt
|
CUTLASS 3.4.0 (#1286)
|
2023-12-29 15:21:31 -05:00 |