|  4a68cf748e * added support of b2b bmm * fixed arguments and params structures * added batch_count argument * removed SplitKSerial and added new test case with b2b bmm * fixed support of Kbatched and added new test case with batch stride * added batch support for bias and scale * make test * small changes --------- Co-authored-by: Haicheng Wu <haichengw@nvidia.com> | ||
|---|---|---|
| .. | ||
| b2b_gemm.h | ||
| b2b_implicit_gemm_convolution.h | ||
| default_b2b_conv2d_fprop_sm75.h | ||
| default_b2b_conv2d_fprop_sm80.h | ||
| default_b2b_conv2d_fprop_smem_accumulator_sm75.h | ||
| default_b2b_conv2d_fprop_smem_accumulator_sm80.h | ||
| default_b2b_conv2d_fprop.h | ||
| default_b2b_gemm_smem_accumulator.h | ||
| default_b2b_gemm.h | ||