From dd4dd4cebfcf8d4f08fd9155e497dfb5d94d25e6 Mon Sep 17 00:00:00 2001 From: Duane Merrill Date: Tue, 5 Dec 2017 20:58:01 -0500 Subject: [PATCH] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 6dc4200e..029c5353 100644 --- a/README.md +++ b/README.md @@ -4,8 +4,8 @@ CUTLASS is a collection of CUDA C++ template abstractions for implementing high-performance matrix-multiplication (GEMM) at all levels and scales within CUDA. -It incorporates the same strategies for hierarchical decomposition and data movement -that are used to implement cuBLAS. CUTLASS decomposes these “moving parts” into +It incorporates strategies for hierarchical decomposition and data movement similar +to those used to implement cuBLAS. CUTLASS decomposes these “moving parts” into reusable, modular software components abstracted by C++ template classes. These thread-wide, warp-wide, block-wide, and device-wide primitives can be specialized and tuned via custom tiling sizes, data types, and other algorithmic policy. The