cutlass/tools/library/src
Vijay Thakkar e01b9b5029
Shard gemm reference templates into multiple TUs for parallel compilation (#1043)
* Split apart gemm reference templates into multiple TUs for parallel compilation

* remove old files

* better balancing of ref kernels across TUs

* remove 3 new added refcheck kernels and some un-necessary fp8 library instances to reduce lib size

* remove auto fp8 kernels

* remove some redundant kernels
2023-08-30 16:46:30 -04:00
..
reduction CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
reference Shard gemm reference templates into multiple TUs for parallel compilation (#1043) 2023-08-30 16:46:30 -04:00
conv2d_operation.h Fix typos 2 (#842) 2023-03-09 23:22:56 -05:00
conv3d_operation.h Fix typos 2 (#842) 2023-03-09 23:22:56 -05:00
gemm_operation_3x.hpp CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
gemm_operation.h CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
handle.cu CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
library_internal.h Updates for 3.1 (#932) 2023-04-29 09:34:27 -04:00
manifest.cpp CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
operation_table.cu CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
rank_2k_operation.h Fix typos 2 (#842) 2023-03-09 23:22:56 -05:00
rank_k_operation.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
singleton.cu New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
symm_operation.h Fix typos 2 (#842) 2023-03-09 23:22:56 -05:00
trmm_operation.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
util.cu CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00