Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
The shared memory to swizzle the data in the epilogue.
#include <gemm_epilogue_traits.h>
Public Attributes | |
StreamSharedStorage | shared_stream |
StreamSharedStorage cutlass::gemm::GemmEpilogueTraits< OutputTile_, Accumulators_, GlobalLoadIteratorC_, GlobalTransformerC_, GlobalTransformerD_, GlobalStoreIteratorD_, SharedStoreIteratorD_, SharedStoreTransformerD_, SharedLoadIteratorD_, Iterations_, Delta_, Functor_, Index_ >::SharedStorage::shared_stream |