Updated GEMM performance plot with CUTLASS 2.8 compiled using CUDA 11.5 Toolkit.
GPUs under test:
NVIDIA A100
NVIDIA A2
NVIDIA TitanV
NVIDIA GeForce 2080 Ti
124 KiB
2288x1203px
124 KiB
2288x1203px