![]() CUTLASS 2.4 (Implicit GEMM Convolution) Co-authored-by: Manish Gupta <manigupta@nvidia.com>, Haicheng Wu <haichengw@nvidia.com>, Dustyn Blasig <dblasig@nvidia.com>, Andrew Kerr <akerr@nvidia.com> |
||
---|---|---|
.. | ||
device | ||
kernel | ||
threadblock | ||
b2b_gemm_f16t_f16n_f16t_tensor_op_f16_sm75.h | ||
b2b_gemm_run.h | ||
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm75.h | ||
b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm80.h | ||
b2b_interleaved_gemm_run.h | ||
CMakeLists.txt | ||
fused_gemm.cu |