[NFC] improve doc: fix typo in mma doc (#1417)

This commit is contained in:
Tom Tan 2024-03-27 11:07:20 -07:00 committed by GitHub
parent c4e3e122e2
commit 8f7d2789b8
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -433,7 +433,7 @@ where we see 16 copies of the 64x8 tile.
### A and B Layout Mapping
GMMA atoms that consume A and B sources directly from shared memory are a bit interesting. The GMMA Descriptor is constructed on an entore tile of A and/or B data in shared memory rather than being partitioned by threads. That is, every thread sees the entire tile of data and the tile is not reordered so that the descriptor can be constructed on it. In `ALayout` form, this can be expressed
GMMA atoms that consume A and B sources directly from shared memory are a bit interesting. The GMMA Descriptor is constructed on an entire tile of A and/or B data in shared memory rather than being partitioned by threads. That is, every thread sees the entire tile of data and the tile is not reordered so that the descriptor can be constructed on it. In `ALayout` form, this can be expressed
```cpp
// (T128,V64x8) -> (M64,K16)