cutlass/include/cute/arch
Christian Sigg e1483d5fa0
Collection of changes to fix clang build. (#1200)
* Remove unused variables

* Qualify calls to make_fragment_? from templated base class.

Fixes clang build error.

* Add missing `#include <cstdio>`

* Various changes to fix clang compile errors.

* More changes to fix clang build.

Remaining issues:

- `params` initializer of `CollectiveEpilogue`.
- `ops` initializer of `Sm90VisitorImplBase`.
- `__usAtomicCAS` needs to be added to clang upstream.

* Fix remaining clang build issues.

* Qualify `cute::rank()` calls.

* Qualify some more calls that are otherwise ambiguous between `cute` and `std` namespace.

* Double-escape special registers in inline asm.

* small change

---------

Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2023-12-08 14:42:12 -05:00
..
cluster_sm90.hpp Collection of changes to fix clang build. (#1200) 2023-12-08 14:42:12 -05:00
copy_sm75.hpp CUTLASS 3.1 (#915) 2023-04-14 23:19:34 -04:00
copy_sm80.hpp fix cp.async L2 prefetch typo (#1187) 2023-11-28 16:58:04 -05:00
copy_sm90_desc.hpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
copy_sm90_tma.hpp CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
copy_sm90.hpp CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
copy.hpp Updates and Bug fixes to CUTLASS 3.3 (#1232) 2023-12-05 09:50:49 -05:00
mma_sm61.hpp CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
mma_sm70.hpp CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
mma_sm75.hpp CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
mma_sm80.hpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
mma_sm90_desc.hpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
mma_sm90_gmma.hpp CUTLASS 3.2 (#1024) 2023-08-07 20:50:32 -04:00
mma_sm90.hpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
mma.hpp CUTLASS 3.0.0 (#786) 2023-01-23 20:55:28 -05:00
util.hpp Fix host compilation of cute::cast_smem_ptr_to_uint. (#940) 2023-05-10 00:06:54 -04:00