cutlass

History

Andrew Kerr c53f3339bb CUTLASS 2.3 initial commit (#134 ) CUTLASS 2.3 adds GEMMs targeting Sparse Tensor Cores on the NVIDIA Ampere Architecture, fast SGEMM, and small matrix classes, bug fixes, and performance enhancements.		2020-09-23 14:00:58 -07:00
..
device	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
kernel	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
threadblock	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
b2b_gemm_f16t_f16n_f16t_tensor_op_f16_sm75.h	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
b2b_gemm_run.h	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm75.h	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm80.h	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
b2b_interleaved_gemm_run.h	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00
CMakeLists.txt	CUTLASS 2.2 (#96 )	2020-06-08 16:17:35 -07:00
fused_gemm.cu	CUTLASS 2.3 initial commit (#134 )	2020-09-23 14:00:58 -07:00