Update Hopper performance plot for CUTLASS 3.1 + CTK 12.1 (#967)
This commit is contained in:
parent
7dbf423763
commit
fde824af21
@ -87,7 +87,7 @@ Starting from CUTLASS 3.0, CUTLASS removed support for the following:
|
|||||||
|
|
||||||
# Performance
|
# Performance
|
||||||
|
|
||||||
<p align="center"><img src=media/images/cutlass-3.0-gemm-peak-performance.png></p>
|
<p align="center"><img src=media/images/cutlass-3.1-gemm-peak-performance.png></p>
|
||||||
|
|
||||||
CUTLASS primitives are very efficient. When used to construct device-wide GEMM kernels,
|
CUTLASS primitives are very efficient. When used to construct device-wide GEMM kernels,
|
||||||
they exhibit peak performance comparable to cuBLAS for scalar GEMM
|
they exhibit peak performance comparable to cuBLAS for scalar GEMM
|
||||||
|
BIN
media/images/cutlass-3.1-gemm-peak-performance.png
Normal file
BIN
media/images/cutlass-3.1-gemm-peak-performance.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 163 KiB |
Loading…
Reference in New Issue
Block a user