Vijay Thakkar
|
be60a0b272
|
CUTLASS 3.5.1 (#1623)
* CUTLASS 3.5.1
* updates, optimizations, fixes
|
2024-07-29 08:46:24 -04:00 |
|
ANIKET SHIVAM
|
751eb9a885
|
Update license year (#1306)
|
2024-01-16 14:37:22 -05:00 |
|
Jee Li
|
c9591a694d
|
fix typo (#1279)
|
2024-01-04 12:41:39 -05:00 |
|
ANIKET SHIVAM
|
90d3b0fb18
|
CUTLASS 3.2.1 (#1113)
* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels.
|
2023-09-26 17:24:26 -04:00 |
|
ANIKET SHIVAM
|
7c04f95415
|
Updates for 3.1 (#932)
|
2023-04-29 09:34:27 -04:00 |
|
ANIKET SHIVAM
|
66d9cddc83
|
New updates for 2.11 (#775)
* New updates.
* Minor profiler updates
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-01-20 16:32:57 -05:00 |
|
ANIKET SHIVAM
|
b72cbf957d
|
CUTLASS 2.10 (#615)
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2022-09-03 18:48:46 -04:00 |
|
Exusial
|
310ed81ac3
|
fix description in example 12. (#444)
Co-authored-by: Exusial <Exusial>
|
2022-04-24 16:29:06 -04:00 |
|
Andrew Kerr
|
12f4108ac2
|
CUTLASS 2.9 (#468)
|
2022-04-23 15:02:38 -04:00 |
|
Manish Gupta
|
1ac4559d12
|
Cutlass 2.6 Update 1 (#301)
* cutlass 2.6 update
* remove debug prints
|
2021-07-27 17:58:30 -07:00 |
|
Andrew Kerr
|
0e13748649
|
CUTLASS 2.5
|
2021-02-26 09:58:26 -05:00 |
|
Manish Gupta
|
6615010cd0
|
CUTLASS 2.4 (Implicit GEMM convolution) (#147)
CUTLASS 2.4 (Implicit GEMM Convolution)
Co-authored-by: Manish Gupta <manigupta@nvidia.com>, Haicheng Wu <haichengw@nvidia.com>, Dustyn Blasig <dblasig@nvidia.com>, Andrew Kerr <akerr@nvidia.com>
|
2020-11-19 21:25:25 -08:00 |
|
hwu36
|
4dac7490e6
|
Typoes (#107)
* Update splitk_gemm.cu
* Update gemm_bias_relu.cu
* Update mma_sm75.h
|
2020-07-13 14:25:52 -07:00 |
|
Andrew Kerr
|
86931fef85
|
CUTLASS 2.2 (#96)
Adds support for NVIDIA Ampere Architecture features. CUDA 11 Toolkit recommended.
|
2020-06-08 16:17:35 -07:00 |
|