cutlass/test/unit/gemm/threadblock
Andrew Kerr 96dab34ad9
CUTLASS 2.1 (#83)
CUTLASS 2.1 contributes:
- BLAS-style host-side API added to CUTLASS Library
- Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores
- Minor enhancements and bug fixes
2020-04-07 13:51:25 -07:00
..
batched_gemv.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
CMakeLists.txt CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
epilogue_workspace.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_simt.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_sm70.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_sm75.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_testbed.h CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_wmma_sm70.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_pipelined_wmma_sm75.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_planar_complex_testbed.h CUTLASS 2.1 (#83) 2020-04-07 13:51:25 -07:00
mma_singlestage_wmma_sm70.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
mma_singlestage_wmma_sm75.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00