cutlass/test/unit/epilogue/threadblock
Andrew Kerr 96dab34ad9
CUTLASS 2.1 (#83)
CUTLASS 2.1 contributes:
- BLAS-style host-side API added to CUTLASS Library
- Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores
- Minor enhancements and bug fixes
2020-04-07 13:51:25 -07:00
..
CMakeLists.txt CUTLASS 2.1 (#83) 2020-04-07 13:51:25 -07:00
epilogue_planar_complex.cu CUTLASS 2.1 (#83) 2020-04-07 13:51:25 -07:00
epilogue_simt_sm60.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
epilogue_simt_sm61.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
epilogue_simt.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
epilogue_tensor_op.cu CUTLASS 2.1 (#83) 2020-04-07 13:51:25 -07:00
epilogue_volta_tensor_op.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
epilogue_wmma_tensor_op_sm70.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
output_tile_threadmap.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
predicated_tile_iterator.cu CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00
testbed_planar_complex.h CUTLASS 2.1 (#83) 2020-04-07 13:51:25 -07:00
testbed.h CUTLASS 2.0 (#62) 2019-11-19 16:55:34 -08:00