CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes |
||
|---|---|---|
| .. | ||
| code_organization.md | ||
| doxygen_mainpage.md | ||
| efficient_gemm.md | ||
| functionality.md | ||
| fundamental_types.md | ||
| gemm_api.md | ||
| layout.md | ||
| profiler.md | ||
| programming_guidelines.md | ||
| quickstart.md | ||
| terminology.md | ||
| tile_iterator_concept.md | ||
| utilities.md | ||