00_basic_gemm
|
bug fixes
|
2021-06-02 10:08:25 -07:00 |
01_cutlass_utilities
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
02_dump_reg_shmem
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
03_visualize_layout
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
04_tile_iterator
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
05_batched_gemm
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
06_splitK_gemm
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
07_volta_tensorop_gemm
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
08_turing_tensorop_gemm
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
09_turing_tensorop_conv2dfprop
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
10_planar_complex
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
11_planar_complex_array
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
12_gemm_bias_relu
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
13_two_tensor_op_fusion
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
14_ampere_tf32_tensorop_gemm
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
15_ampere_sparse_tensorop_gemm
|
fix a wrong description
|
2021-04-22 20:28:28 +08:00 |
16_ampere_tensorop_conv2dfprop
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
17_fprop_per_channel_bias
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
common
|
CUTLASS 2.0 (#62)
|
2019-11-19 16:55:34 -08:00 |
CMakeLists.txt
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |