![]() * Release 3.3.0 Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. * minor doc update |
||
---|---|---|
.. | ||
52_hopper_gather_scatter_fusion.cu | ||
CMakeLists.txt | ||
gather_gemm.hpp | ||
gather_kernel.cuh | ||
gather_tensor.hpp | ||
scatter_epilogue.hpp |