From eb7f99d3dda9d5969f88dcc3d8b3e5dac59920e9 Mon Sep 17 00:00:00 2001 From: Muhammad Osama Date: Tue, 10 Jan 2023 17:39:06 -0800 Subject: [PATCH] @hwu36 Adding the individual arXiv link for Stream-K paper. (#764) * Stream-K individual paper entry. * arXiv links updated. --- PUBLICATIONS.md | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/PUBLICATIONS.md b/PUBLICATIONS.md index 7bff5a60..a2b2d90a 100644 --- a/PUBLICATIONS.md +++ b/PUBLICATIONS.md @@ -1,8 +1,12 @@ # Publications Using Cutlass +## 2023 + +- ["Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU"](https://arxiv.org/abs/2301.03598). Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens. _arXiv_, January 2023. + ## 2022 -- ["GPU Load Balancing"](https://arxiv.org/pdf/2212.08964.pdf). Muhammad Osama. _Doctoral dissertation, University of California, Davis_, December 2022. +- ["GPU Load Balancing"](https://arxiv.org/abs/2212.08964). Muhammad Osama. _Doctoral dissertation, University of California, Davis_, December 2022. - ["Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance"](https://arxiv.org/abs/2110.15238). Jiarong Xing, Leyuan Wang, Shang Zhang, Jack Chen, Ang Chen, Yibo Zhu. _Proceedings of the 5th MLSys Conference_, August 2022.