cutlass

History

Manish Gupta 6615010cd0 CUTLASS 2.4 (Implicit GEMM convolution) (#147 ) CUTLASS 2.4 (Implicit GEMM Convolution) Co-authored-by: Manish Gupta <manigupta@nvidia.com>, Haicheng Wu <haichengw@nvidia.com>, Dustyn Blasig <dblasig@nvidia.com>, Andrew Kerr <akerr@nvidia.com>		2020-11-19 21:25:25 -08:00
..
00_basic_gemm	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
01_cutlass_utilities	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
02_dump_reg_shmem	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
03_visualize_layout	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
04_tile_iterator	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
05_batched_gemm	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
06_splitK_gemm	Typoes (#107 )	2020-07-13 14:25:52 -07:00
07_volta_tensorop_gemm	Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>. (#100 )	2020-06-15 10:47:01 -07:00
08_turing_tensorop_gemm	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
09_turing_tensorop_conv2dfprop	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
10_planar_complex	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
11_planar_complex_array	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
12_gemm_bias_relu	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
13_fused_two_gemms	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
14_ampere_tf32_tensorop_gemm	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
15_ampere_sparse_tensorop_gemm	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
22_ampere_tensorop_conv2dfprop	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00
common	CUTLASS 2.0 (#62 )	2019-11-19 16:55:34 -08:00
CMakeLists.txt	CUTLASS 2.4 (Implicit GEMM convolution) (#147 )	2020-11-19 21:25:25 -08:00