![]() CUTLASS 2.3 adds GEMMs targeting Sparse Tensor Cores on the NVIDIA Ampere Architecture, fast SGEMM, and small matrix classes, bug fixes, and performance enhancements. |
||
---|---|---|
.. | ||
b2b_mma_base.h | ||
b2b_mma_multistage.h | ||
b2b_mma_pipelined.h | ||
default_b2b_mma.h |