cutlass/tools/library/include/cutlass/library
Pradeep Ramani c008b4aea8
CUTLASS 3.3.0 (#1167)
* Release 3.3.0

Adds support for mixed precision GEMMs On Hopper and Ampere
Adds support for < 16B aligned GEMMs on Hopper
Enhancements to EVT
Enhancements to Python interface
Enhancements to Sub-byte type handling in CuTe
Several other bug-fixes and performance improvements.

* minor doc update
2023-11-02 11:09:05 -04:00
..
arch_mappings.h CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
descriptions.h CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
handle.h CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
library.h CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
manifest.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
operation_table.h CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
singleton.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
types.h Adding more Threadblock Tiles for Mixed-input TensorOp (BF16 * S8) in cutlass_library (#1132) 2023-10-13 11:33:15 -04:00
util.h CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00