Yujia Zhai
|
cc3c29a81a
|
CUTLASS 3.6.0 (#1850)
* v3.6
* update changelog
* update readme
* fix typo
* fixing typos
* hopper gemm with weight prefetch
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2024-10-09 15:33:27 -04:00 |
|
Junkai-Wu
|
dbdae514e0
|
Support for TMA Epilogue for Group Gemm and add pingpong ptr array & Group Gemm (#1795)
|
2024-09-11 00:07:31 -04:00 |
|
Vijay Thakkar
|
be60a0b272
|
CUTLASS 3.5.1 (#1623)
* CUTLASS 3.5.1
* updates, optimizations, fixes
|
2024-07-29 08:46:24 -04:00 |
|
ANIKET SHIVAM
|
bbe579a9e3
|
Updates for CUTLASS 3.4.1 (#1346)
* Updates for CUTLASS 3.4.1
* minor epi change
|
2024-02-15 15:48:34 -05:00 |
|
ANIKET SHIVAM
|
751eb9a885
|
Update license year (#1306)
|
2024-01-16 14:37:22 -05:00 |
|
Pradeep Ramani
|
8236f30675
|
CUTLASS 3.4.0 (#1286)
* CUTLASS 3.4.0
* Update CHANGELOG.md
---------
Co-authored-by: Pradeep Ramani <prramani@nvidia.com>
|
2023-12-29 15:21:31 -05:00 |
|