diff --git a/CHANGELOG.md b/CHANGELOG.md index d7a57e30..d83d451d 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -18,7 +18,7 @@ * Optimal performance using [**CUDA 11.5**](https://developer.nvidia.com/cuda-downloads) * Updates from the community (thanks!) -* **Deprecation announcement:** CUTLASS plans to deprecate the following platforms in the future. Let us know if this affects your use case. +* **Deprecation announcement:** CUTLASS plans to deprecate the following: * Maxwell and Pascal GPU architectures * Ubuntu 16.04 * CUDA 10.2 diff --git a/CONTRIBUTORS.md b/CONTRIBUTORS.md index 5ce83dfb..861f4460 100644 --- a/CONTRIBUTORS.md +++ b/CONTRIBUTORS.md @@ -9,7 +9,7 @@ This is the official list of CUTLASS developers and contributors. ## DEVELOPERS Andrew Kerr Haicheng Wu -Manish Gupta +Manish Gupta Dustyn Blasig Pradeep Ramani Naila Farooqui @@ -24,6 +24,9 @@ Markus Hohnerbach Aditya Atluri David Tanner Manikandan Ananth + +## CUTLASS Product Manager +Matthew Nicely ## CONTRIBUTORS Timothy Costa @@ -56,7 +59,6 @@ Olivier Giroux Stephen Jones Rishkul Kulkarni Bryce Lelbach -Matthew Nicely Joel McCormack Kyrylo Perelygin diff --git a/README.md b/README.md index 36eb9511..1f1bef8e 100644 --- a/README.md +++ b/README.md @@ -41,7 +41,7 @@ CUTLASS 2.8 is an update to CUTLASS adding: - [Grouped GEMM:](/examples/24_gemm_grouped) similar to batched GEMM with distinct problem size per group - [Implicit GEMM Convolution fusion](/examples/13_two_tensor_op_fusion/) supports staging 1st convolution's output accumulator in the shared memory on Turing. - Optimal performance using [CUDA 11.5](https://developer.nvidia.com/cuda-downloads) -- CUTLASS plans to **deprecate** the following platforms in the future. Let us know if this affects your use case. +- **Deprecation announcement:** CUTLASS plans to deprecate the following: - Maxwell and Pascal GPU architectures - Ubuntu 16.04 - CUDA 10.2