ANIKET SHIVAM
|
90d3b0fb18
|
CUTLASS 3.2.1 (#1113)
* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels.
|
2023-09-26 17:24:26 -04:00 |
|
ANIKET SHIVAM
|
4575443d44
|
CUTLASS 3.2 (#1024)
* CUTLASS 3.2
|
2023-08-07 20:50:32 -04:00 |
|
Jack Kosaian
|
87349d3496
|
Add grouped b2b GEMM (#970)
|
2023-06-05 17:16:57 -04:00 |
|
ANIKET SHIVAM
|
f079619f5e
|
More updates for 3.1 (#958)
* Updates for 3.1
* Minor change
* doc link fix
* Minor updates
|
2023-05-24 10:17:16 -04:00 |
|
Alexander Pivovarov
|
7e370c9637
|
Fix typos 2 (#842)
Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>
|
2023-03-09 23:22:56 -05:00 |
|
ANIKET SHIVAM
|
66d9cddc83
|
New updates for 2.11 (#775)
* New updates.
* Minor profiler updates
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-01-20 16:32:57 -05:00 |
|
Haicheng Wu
|
497b499d9d
|
Add residual support for shmem staging iterator used in back-to-back GEMM fusion. This allows support of problem_size_0_n that is not multiple of 32. (#590)
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2022-08-15 11:19:24 -04:00 |
|
Haicheng Wu
|
ec2b4fd85d
|
b2b bias vector support (#482)
* b2b bias vector support
* add files
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2022-04-30 04:16:15 -07:00 |
|
Andrew Kerr
|
12f4108ac2
|
CUTLASS 2.9 (#468)
|
2022-04-23 15:02:38 -04:00 |
|
Manish Gupta
|
808c25337a
|
CUTLASS 2.8 (#363)
CUTLASS 2.8
|
2021-11-19 13:26:35 -08:00 |
|
Manish Gupta
|
6c2f8f2fb8
|
CUTLASS 2.6.1 - functional and performance enhancements to strided DGRAD, fixes, and tuning
* cutlass 2.6 update
* remove debug prints
* cutlass 2.6.1 (minor update)
* Updated CHANGELOG.
* Minor edit to readme to indicate patch version.
* Minor edit to readme.
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>, Andrew Kerr <akerr@nvidia.com>
|
2021-09-03 10:26:15 -07:00 |
|
Manish Gupta
|
1ac4559d12
|
Cutlass 2.6 Update 1 (#301)
* cutlass 2.6 update
* remove debug prints
|
2021-07-27 17:58:30 -07:00 |
|
Andrew Kerr
|
0e13748649
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
|