cutlass/examples/52_hopper_gather_scatter_fusion
Pradeep Ramani c008b4aea8
CUTLASS 3.3.0 (#1167)
* Release 3.3.0

Adds support for mixed precision GEMMs On Hopper and Ampere
Adds support for < 16B aligned GEMMs on Hopper
Enhancements to EVT
Enhancements to Python interface
Enhancements to Sub-byte type handling in CuTe
Several other bug-fixes and performance improvements.

* minor doc update
2023-11-02 11:09:05 -04:00
..
52_hopper_gather_scatter_fusion.cu CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
CMakeLists.txt CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
gather_gemm.hpp CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
gather_kernel.cuh CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
gather_tensor.hpp CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
scatter_epilogue.hpp CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00