Yujia Zhai
|
cc3c29a81a
|
CUTLASS 3.6.0 (#1850)
* v3.6
* update changelog
* update readme
* fix typo
* fixing typos
* hopper gemm with weight prefetch
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2024-10-09 15:33:27 -04:00 |
|
Vijay Thakkar
|
be60a0b272
|
CUTLASS 3.5.1 (#1623)
* CUTLASS 3.5.1
* updates, optimizations, fixes
|
2024-07-29 08:46:24 -04:00 |
|
Vijay Thakkar
|
629f4653c3
|
CUTLASS 3.5.0 (#1411)
|
2024-03-19 17:51:04 -04:00 |
|
ANIKET SHIVAM
|
bbe579a9e3
|
Updates for CUTLASS 3.4.1 (#1346)
* Updates for CUTLASS 3.4.1
* minor epi change
|
2024-02-15 15:48:34 -05:00 |
|
ANIKET SHIVAM
|
751eb9a885
|
Update license year (#1306)
|
2024-01-16 14:37:22 -05:00 |
|
ANIKET SHIVAM
|
2f589ffa76
|
Updates for 3.4 release. (#1305)
|
2024-01-16 13:42:51 -05:00 |
|
Pradeep Ramani
|
8236f30675
|
CUTLASS 3.4.0 (#1286)
* CUTLASS 3.4.0
* Update CHANGELOG.md
---------
Co-authored-by: Pradeep Ramani <prramani@nvidia.com>
|
2023-12-29 15:21:31 -05:00 |
|
ANIKET SHIVAM
|
90d3b0fb18
|
CUTLASS 3.2.1 (#1113)
* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels.
|
2023-09-26 17:24:26 -04:00 |
|
ANIKET SHIVAM
|
4575443d44
|
CUTLASS 3.2 (#1024)
* CUTLASS 3.2
|
2023-08-07 20:50:32 -04:00 |
|
Jakub Szuppe
|
180c5629bf
|
Add missing checks for NVRTC in CuTe (#921)
|
2023-04-25 12:52:43 -04:00 |
|
ANIKET SHIVAM
|
d572cc1aab
|
CUTLASS 3.1 (#915)
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-04-14 23:19:34 -04:00 |
|
Vijay Thakkar
|
277bd6e537
|
CUTLASS 3.0.0 (#786)
* CUTLASS 3.0.0
|
2023-01-23 20:55:28 -05:00 |
|