CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes |
||
|---|---|---|
| .. | ||
| device | ||
| thread | ||
| threadblock | ||
| warp | ||
| CMakeLists.txt | ||
CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes |
||
|---|---|---|
| .. | ||
| device | ||
| thread | ||
| threadblock | ||
| warp | ||
| CMakeLists.txt | ||