cutlass/examples
Exusial 310ed81ac3
fix description in example 12. (#444)
Co-authored-by: Exusial <Exusial>
2022-04-24 16:29:06 -04:00
..
00_basic_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
01_cutlass_utilities CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
02_dump_reg_shmem CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
03_visualize_layout CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
04_tile_iterator CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
05_batched_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
06_splitK_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
07_volta_tensorop_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
08_turing_tensorop_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
09_turing_tensorop_conv2dfprop CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
10_planar_complex CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
11_planar_complex_array CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
12_gemm_bias_relu fix description in example 12. (#444) 2022-04-24 16:29:06 -04:00
13_two_tensor_op_fusion CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
14_ampere_tf32_tensorop_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
15_ampere_sparse_tensorop_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
16_ampere_tensorop_conv2dfprop CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
17_fprop_per_channel_bias CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
18_ampere_fp64_tensorop_affine2_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
19_tensorop_canonical CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
20_simt_canonical CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
21_quaternion_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
22_quaternion_conv CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
23_ampere_gemm_operand_reduction_fusion CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
24_gemm_grouped CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
25_ampere_fprop_mainloop_fusion CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
26_ampere_wgrad_mainloop_fusion CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
27_ampere_3xtf32_fast_accurate_tensorop_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
28_ampere_3xtf32_fast_accurate_tensorop_fprop CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
29_ampere_3xtf32_fast_accurate_tensorop_complex_gemm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
30_wgrad_split_k CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
31_basic_syrk CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
32_basic_trmm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
33_ampere_3xtf32_tensorop_symm CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
34_transposed_conv2d CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
35_gemm_softmax CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
36_gather_scatter_fusion CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
40_cutlass_py CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00
common CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
CMakeLists.txt CUTLASS 2.9 (#468) 2022-04-23 15:02:38 -04:00