cutlass/tools/library
Vijay Thakkar e01b9b5029
Shard gemm reference templates into multiple TUs for parallel compilation (#1043)
* Split apart gemm reference templates into multiple TUs for parallel compilation

* remove old files

* better balancing of ref kernels across TUs

* remove 3 new added refcheck kernels and some un-necessary fp8 library instances to reduce lib size

* remove auto fp8 kernels

* remove some redundant kernels
2023-08-30 16:46:30 -04:00
..
include/cutlass/library CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
scripts Shard gemm reference templates into multiple TUs for parallel compilation (#1043) 2023-08-30 16:46:30 -04:00
src Shard gemm reference templates into multiple TUs for parallel compilation (#1043) 2023-08-30 16:46:30 -04:00
CMakeLists.txt Shard gemm reference templates into multiple TUs for parallel compilation (#1043) 2023-08-30 16:46:30 -04:00