* Updated documentation of fused GEMM example and removed UNITY BUILD batch size. The default batch size when unity build is enabled tends to be favorable. |
||
|---|---|---|
| .. | ||
| device | ||
| kernel | ||
| threadblock | ||
| b2b_gemm_f16t_f16n_f16t_tensor_op_f16_sm75.h | ||
| b2b_gemm_run.h | ||
| b2b_gemm_s8n_s8t_s8n_tensor_op_s32_sm75.h | ||
| b2b_interleaved_gemm_run.h | ||
| CMakeLists.txt | ||
| fused_gemm.cu | ||