cutlass/examples/03_strided_batched_gemm
Andrew Kerr 877bdcace6
Cutlass 1.3 Release (#42)
CUTLASS 1.3 Release
- Efficient GEMM kernel targeting Volta Tensor Cores via mma.sync instruction added in CUDA 10.1.
2019-03-20 10:49:17 -07:00
..
CMakeLists.txt Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
strided_batched_gemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00