* v3.6 * update changelog * update readme * fix typo * fixing typos * hopper gemm with weight prefetch --------- Co-authored-by: yuzhai <yuzhai@nvidia.com> Co-authored-by: Haicheng Wu <haichengw@nvidia.com> |
||
|---|---|---|
| .. | ||
| 53_hopper_gemm_permute.cu | ||
| CMakeLists.txt | ||
| permute_kernel.cuh | ||
| permute_traits.hpp | ||