CUTLASS 2.3 adds GEMMs targeting Sparse Tensor Cores on the NVIDIA Ampere Architecture, fast SGEMM, and small matrix classes, bug fixes, and performance enhancements. |
||
|---|---|---|
| .. | ||
| CMakeLists.txt | ||
| matrix.cu | ||
| tensor_nhwc.cu | ||
| tensor.cu | ||
CUTLASS 2.3 adds GEMMs targeting Sparse Tensor Cores on the NVIDIA Ampere Architecture, fast SGEMM, and small matrix classes, bug fixes, and performance enhancements. |
||
|---|---|---|
| .. | ||
| CMakeLists.txt | ||
| matrix.cu | ||
| tensor_nhwc.cu | ||
| tensor.cu | ||