* v3.6 * update changelog * update readme * fix typo * fixing typos * hopper gemm with weight prefetch --------- Co-authored-by: yuzhai <yuzhai@nvidia.com> Co-authored-by: Haicheng Wu <haichengw@nvidia.com> |
||
|---|---|---|
| .. | ||
| 52_hopper_gather_scatter_fusion.cu | ||
| CMakeLists.txt | ||
| gather_gemm.hpp | ||
| gather_kernel.cuh | ||
| scatter_epilogue.hpp | ||