streamk paper link (#765)
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
This commit is contained in:
parent
eb7f99d3dd
commit
8b42e751c6
@ -34,7 +34,7 @@
|
||||
"classic data-parallel" and "Split-K" decompositions.
|
||||
|
||||
For more details regarding the Stream-K method, see "Stream-K: Work-centric Parallel Decomposition
|
||||
for Dense Matrix-Matrix Multiplication on the GPU" <todo: link>
|
||||
for Dense Matrix-Matrix Multiplication on the GPU" (https://arxiv.org/abs/2301.03598)
|
||||
|
||||
Requires NVIDIA Ampere or newer device (SM80+).
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user