cutlass/include/cutlass/arch
Gregory Meyer (gregjm) ecbd24566c
Enable shared memory intrinsics and ldmatrix PTX on Clang. (#754)
* Enable shared memory intrinsics and ldmatrix PTX on Clang.

This commit adds preprocessor checks to enable the shared memory
intrinsics `__cvta_generic_to_shared` and `__nvvm_get_smem_pointer`, as
well as the `ldmatrix` PTX instructions, on Clang. Preventing these
intrinsics from being used is a significant latency regression on Clang.

* refine the macro

---------

Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2023-03-31 21:42:24 -04:00
..
arch.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
barrier.h CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
cache_operation.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
memory_sm75.h Enable shared memory intrinsics and ldmatrix PTX on Clang. (#754) 2023-03-31 21:42:24 -04:00
memory_sm80.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
memory.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm50.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm60.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm61.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm70.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm75.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma_sm80.h CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
mma_sm90.h Add missing comma in cutlass/arch/mma_sm90.h (#862) 2023-03-14 12:04:28 -04:00
mma_sparse_sm80.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
mma.h Fix typos 2 (#842) 2023-03-09 23:22:56 -05:00
reg_reconfig.h CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
simd_sm60.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
simd_sm61.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
simd.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
wmma_sm70.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
wmma_sm72.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
wmma_sm75.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00
wmma.h New updates for 2.11 (#775) 2023-01-20 16:32:57 -05:00