![]() CUTLASS 1.3 Release - Efficient GEMM kernel targeting Volta Tensor Cores via mma.sync instruction added in CUDA 10.1. |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
strided_batched_gemm.cu |
![]() CUTLASS 1.3 Release - Efficient GEMM kernel targeting Volta Tensor Cores via mma.sync instruction added in CUDA 10.1. |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
strided_batched_gemm.cu |