add a CUTLASS publication (#893)
* add bytetransformer * update arxiv link * re-order
This commit is contained in:
parent
77549ae6c8
commit
87070b6d51
@ -2,6 +2,8 @@
|
|||||||
|
|
||||||
## 2023
|
## 2023
|
||||||
|
|
||||||
|
- ["ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs"](https://arxiv.org/abs/2210.03052). Yujia Zhai, Chengquan Jiang, Leyuan Wang, Xiaoying Jia, Shang Zhang, Zizhong Chen, Xin Liu, Yibo Zhu. _Proceedings of the 37th IEEE International Parallel & Distributed Processing Symposium_, May 2023.
|
||||||
|
|
||||||
- ["Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU"](https://arxiv.org/abs/2301.03598). Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens. _arXiv_, January 2023.
|
- ["Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU"](https://arxiv.org/abs/2301.03598). Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens. _arXiv_, January 2023.
|
||||||
|
|
||||||
## 2022
|
## 2022
|
||||||
|
Loading…
Reference in New Issue
Block a user