cutlass/examples/08_turing_tensorop_gemm
Andrew Kerr 1ab1027954
Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>. (#100)
- Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>.
- Enhancement to CUTLASS Utility Library's HostTensorPlanarComplex template to support copy-in and copy-out
- Added test_examples target to build and test all CUTLASS examples
- Minor edits to documentation to point to GTC 2020 webinar
2020-06-15 10:47:01 -07:00
..
CMakeLists.txt CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
turing_tensorop_gemm.cu Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>. (#100) 2020-06-15 10:47:01 -07:00