ANIKET SHIVAM
|
90d3b0fb18
|
CUTLASS 3.2.1 (#1113)
* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels.
|
2023-09-26 17:24:26 -04:00 |
|
Vadim Markovtsev
|
8783c41851
|
Replace 0x1f with 0xffffffff in __shfl_sync (#1097)
This fixes compatibility with H100 and resolves #1094
|
2023-09-18 19:58:19 -04:00 |
|
Alexander Pivovarov
|
7e370c9637
|
Fix typos 2 (#842)
Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>
|
2023-03-09 23:22:56 -05:00 |
|
ANIKET SHIVAM
|
66d9cddc83
|
New updates for 2.11 (#775)
* New updates.
* Minor profiler updates
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-01-20 16:32:57 -05:00 |
|
Aditya Atluri
|
c975e2ccbb
|
releaase 2.11 (#703)
|
2022-11-19 09:02:15 -05:00 |
|