CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes |
||
|---|---|---|
| .. | ||
| gemm_operation.h | ||
| handle.cu | ||
| library_internal.h | ||
| manifest.cpp | ||
| operation_table.cu | ||
| singleton.cu | ||
| util.cu | ||