cutlass/examples
Manish Gupta 6615010cd0
CUTLASS 2.4 (Implicit GEMM convolution) (#147)
CUTLASS 2.4 (Implicit GEMM Convolution)

Co-authored-by: Manish Gupta <manigupta@nvidia.com>, Haicheng Wu <haichengw@nvidia.com>, Dustyn Blasig <dblasig@nvidia.com>, Andrew Kerr <akerr@nvidia.com>
2020-11-19 21:25:25 -08:00
..
00_basic_gemm CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
01_cutlass_utilities CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
02_dump_reg_shmem CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
03_visualize_layout CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
04_tile_iterator CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
05_batched_gemm CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
06_splitK_gemm Typoes (#107) 2020-07-13 14:25:52 -07:00
07_volta_tensorop_gemm Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>. (#100) 2020-06-15 10:47:01 -07:00
08_turing_tensorop_gemm CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
09_turing_tensorop_conv2dfprop CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
10_planar_complex CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
11_planar_complex_array CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
12_gemm_bias_relu CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
13_fused_two_gemms CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
14_ampere_tf32_tensorop_gemm CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
15_ampere_sparse_tensorop_gemm CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
22_ampere_tensorop_conv2dfprop CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
common CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
CMakeLists.txt CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00