From 87070b6d51ea9269bb912f1624087c01b1f2ec6b Mon Sep 17 00:00:00 2001 From: Yujia Zhai Date: Tue, 28 Mar 2023 14:06:57 -0700 Subject: [PATCH] add a CUTLASS publication (#893) * add bytetransformer * update arxiv link * re-order --- PUBLICATIONS.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/PUBLICATIONS.md b/PUBLICATIONS.md index e8959f8f..acbd59cc 100644 --- a/PUBLICATIONS.md +++ b/PUBLICATIONS.md @@ -2,6 +2,8 @@ ## 2023 +- ["ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs"](https://arxiv.org/abs/2210.03052). Yujia Zhai, Chengquan Jiang, Leyuan Wang, Xiaoying Jia, Shang Zhang, Zizhong Chen, Xin Liu, Yibo Zhu. _Proceedings of the 37th IEEE International Parallel & Distributed Processing Symposium_, May 2023. + - ["Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU"](https://arxiv.org/abs/2301.03598). Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens. _arXiv_, January 2023. ## 2022