* Release 3.3.0 Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. * minor doc update |
||
|---|---|---|
| .. | ||
| bulk_load.cu | ||
| bulk_store.cu | ||
| CMakeLists.txt | ||
| stsm.cu | ||
| tma_load_testbed.hpp | ||
| tma_load.cu | ||
| tma_store_testbed.hpp | ||
| tma_store.cu | ||