cutlass/examples/13_fused_two_gemms
Manish Gupta 6615010cd0
CUTLASS 2.4 (Implicit GEMM convolution) (#147)
CUTLASS 2.4 (Implicit GEMM Convolution)

Co-authored-by: Manish Gupta <manigupta@nvidia.com>, Haicheng Wu <haichengw@nvidia.com>, Dustyn Blasig <dblasig@nvidia.com>, Andrew Kerr <akerr@nvidia.com>
2020-11-19 21:25:25 -08:00
..
device CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
kernel CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00
threadblock CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
b2b_gemm_f16t_f16n_f16t_tensor_op_f16_sm75.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
b2b_gemm_run.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm75.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm80.h CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
b2b_interleaved_gemm_run.h CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
CMakeLists.txt CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
fused_gemm.cu CUTLASS 2.4 (Implicit GEMM convolution) (#147) 2020-11-19 21:25:25 -08:00