![]() * v3.6 * update changelog * update readme * fix typo * fixing typos * hopper gemm with weight prefetch --------- Co-authored-by: yuzhai <yuzhai@nvidia.com> Co-authored-by: Haicheng Wu <haichengw@nvidia.com> |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
sm90_conv1d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f16.cu | ||
sm90_conv1d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f32.cu | ||
sm90_conv2d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f16.cu | ||
sm90_conv2d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f32.cu | ||
sm90_conv3d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f16.cu | ||
sm90_conv3d_wgrad_implicit_gemm_f16_f16_f32_tensorop_f32.cu |