![]() * added support of b2b bmm * fixed arguments and params structures * added batch_count argument * removed SplitKSerial and added new test case with b2b bmm * fixed support of Kbatched and added new test case with batch stride * added batch support for bias and scale * make test * small changes --------- Co-authored-by: Haicheng Wu <haichengw@nvidia.com> |
||
---|---|---|
.. | ||
b2b_gemm.h | ||
b2b_implicit_gemm_convolution.h | ||
default_b2b_conv2d_fprop_sm75.h | ||
default_b2b_conv2d_fprop_sm80.h | ||
default_b2b_conv2d_fprop_smem_accumulator_sm75.h | ||
default_b2b_conv2d_fprop_smem_accumulator_sm80.h | ||
default_b2b_conv2d_fprop.h | ||
default_b2b_gemm_smem_accumulator.h | ||
default_b2b_gemm.h |