cutlass

History

Jongsoo Park 3cfa5db2a2 Actually use float accumulation in gemm_f16t_f16t_f16t_wmma_tensor_op… (#407 ) * Actually use float accumulation in gemm_f16t_f16t_f16t_wmma_tensor_op_f32_sm70.cu As title * Update gemm_f16t_f16t_f16t_wmma_tensor_op_f32_sm70.cu change the missing one Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>		2022-02-16 09:53:21 -05:00
..
common	Updates to fused epilogue (#383 )	2021-12-17 16:04:43 -05:00
conv	Enable convolution with fused epilogue for Volta Tensor Cores (#402 )	2022-01-30 23:24:50 -05:00
core	Cutlass 2.6 Update 1 (#301 )	2021-07-27 17:58:30 -07:00
data/hashes	CUTLASS 2.7 (#318 )	2021-09-20 11:02:22 -07:00
epilogue	CUTLASS 2.7 (#318 )	2021-09-20 11:02:22 -07:00
gemm	Actually use float accumulation in gemm_f16t_f16t_f16t_wmma_tensor_op… (#407 )	2022-02-16 09:53:21 -05:00
layout	Cutlass 2.6 Update 1 (#301 )	2021-07-27 17:58:30 -07:00
nvrtc	CUTLASS 2.6.1 - functional and performance enhancements to strided DGRAD, fixes, and tuning	2021-09-03 10:26:15 -07:00
reduction	Cutlass 2.6 Update 1 (#301 )	2021-07-27 17:58:30 -07:00
transform	Cutlass 2.6 Update 1 (#301 )	2021-07-27 17:58:30 -07:00
util	Updates to fused epilogue (#383 )	2021-12-17 16:04:43 -05:00
CMakeLists.txt	Updates to fused epilogue (#383 )	2021-12-17 16:04:43 -05:00
test_unit.cpp	Cutlass 2.6 Update 1 (#301 )	2021-07-27 17:58:30 -07:00