From 8f7d2789b8b04418ac01e515019dc3b74b249070 Mon Sep 17 00:00:00 2001 From: Tom Tan Date: Wed, 27 Mar 2024 11:07:20 -0700 Subject: [PATCH] [NFC] improve doc: fix typo in mma doc (#1417) --- media/docs/cute/0t_mma_atom.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/media/docs/cute/0t_mma_atom.md b/media/docs/cute/0t_mma_atom.md index 77578f81..6f285c26 100644 --- a/media/docs/cute/0t_mma_atom.md +++ b/media/docs/cute/0t_mma_atom.md @@ -433,7 +433,7 @@ where we see 16 copies of the 64x8 tile. ### A and B Layout Mapping -GMMA atoms that consume A and B sources directly from shared memory are a bit interesting. The GMMA Descriptor is constructed on an entore tile of A and/or B data in shared memory rather than being partitioned by threads. That is, every thread sees the entire tile of data and the tile is not reordered so that the descriptor can be constructed on it. In `ALayout` form, this can be expressed +GMMA atoms that consume A and B sources directly from shared memory are a bit interesting. The GMMA Descriptor is constructed on an entire tile of A and/or B data in shared memory rather than being partitioned by threads. That is, every thread sees the entire tile of data and the tile is not reordered so that the descriptor can be constructed on it. In `ALayout` form, this can be expressed ```cpp // (T128,V64x8) -> (M64,K16)