Update README.md

This commit is contained in:
Duane Merrill 2017-12-05 22:58:12 -05:00 committed by GitHub
parent 537a4bcedf
commit 6cb88d53eb
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -25,7 +25,7 @@ in CUDA C++"](https://devblogs.nvidia.com/parallelforall/cutlass-linear-algebra-
# Performance
![ALT](/media/cutlass-performance-plot.png "Relative performance of CUTLASS and cuBLAS for large matrices")
<p align="center"><img src=/media/cutlass-performance-plot.png></p>
CUTLASS primitives are very efficient. When used to construct device-wide GEMM kernels,
they exhibit performance comparable to cuBLAS for scalar GEMM