Fix README (#1658)
* Fix README * Improve README --------- Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>
This commit is contained in:
parent
83ae20c740
commit
e5f3caf145
@ -41,7 +41,6 @@ and improves code composability and readability. More documentation specific to
|
|||||||
|
|
||||||
In addition to GEMMs, CUTLASS implements high-performance convolution via the implicit GEMM algorithm. Implicit GEMM is the formulation of a convolution operation as a GEMM thereby taking advantage of CUTLASS's modular GEMM pipeline. This allows CUTLASS to build convolutions by reusing highly-optimized GEMM components.
|
In addition to GEMMs, CUTLASS implements high-performance convolution via the implicit GEMM algorithm. Implicit GEMM is the formulation of a convolution operation as a GEMM thereby taking advantage of CUTLASS's modular GEMM pipeline. This allows CUTLASS to build convolutions by reusing highly-optimized GEMM components.
|
||||||
|
|
||||||
|
|
||||||
# What's New in CUTLASS 3.6
|
# What's New in CUTLASS 3.6
|
||||||
|
|
||||||
CUTLASS 3.6.0 is an update to CUTLASS adding:
|
CUTLASS 3.6.0 is an update to CUTLASS adding:
|
||||||
|
@ -179,11 +179,13 @@ $ make test_unit_gemm_warp -j
|
|||||||
To minimize compilation time, specific GPU architectures can be enabled via the CMake command,
|
To minimize compilation time, specific GPU architectures can be enabled via the CMake command,
|
||||||
selected by [CUDA Compute Capability.](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capabilities)
|
selected by [CUDA Compute Capability.](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capabilities)
|
||||||
|
|
||||||
**NVIDIA Ampere Architecture.**
|
**NVIDIA Hopper Architecture.**
|
||||||
```bash
|
```bash
|
||||||
$ cmake .. -DCUTLASS_NVCC_ARCHS=90a # compiles for NVIDIA Hopper GPU architecture
|
$ cmake .. -DCUTLASS_NVCC_ARCHS=90a # compiles for NVIDIA Hopper GPU architecture
|
||||||
```
|
```
|
||||||
|
|
||||||
|
**NVIDIA Ampere Architecture.**
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
$ cmake .. -DCUTLASS_NVCC_ARCHS=80 # compiles for NVIDIA Ampere GPU architecture
|
$ cmake .. -DCUTLASS_NVCC_ARCHS=80 # compiles for NVIDIA Ampere GPU architecture
|
||||||
```
|
```
|
||||||
|
Loading…
Reference in New Issue
Block a user