cutlass/tools/test/perf/gemm
Andrew Kerr 877bdcace6
Cutlass 1.3 Release (#42)
CUTLASS 1.3 Release
- Efficient GEMM kernel targeting Volta Tensor Cores via mma.sync instruction added in CUDA 10.1.
2019-03-20 10:49:17 -07:00
..
cublas_dispatch.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
cutlass_dispatch_splitK_PI.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
cutlass_dispatch.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
cutlass_volta884_dispatch_splitK_PI.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
cutlass_volta884_dispatch.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
dgemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
gemm_perf_testbed.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
gemm_profiler.h Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
hgemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
igemm_splitK.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
igemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
sgemm_splitK.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
sgemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm_cta_rasterization_nn.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm_cta_rasterization_nt.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm_cta_rasterization_tn.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm_cta_rasterization_tt.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm_splitK.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
volta884_gemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
wmma_binary_gemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
wmma_gemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00
wmma_integer_gemm.cu Cutlass 1.3 Release (#42) 2019-03-20 10:49:17 -07:00