cutlass/include/cutlass/gemm/device
Andrew Kerr c53f3339bb
CUTLASS 2.3 initial commit (#134)
CUTLASS 2.3 adds GEMMs targeting Sparse Tensor Cores on the NVIDIA Ampere Architecture, fast SGEMM, and small matrix classes, bug fixes, and performance enhancements.
2020-09-23 14:00:58 -07:00
..
default_gemm_configuration.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_array.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_batched.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_complex.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_sparse.h CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
gemm_splitk_parallel.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_universal_adapter.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm_universal_base.h CUTLASS 2.3 initial commit (#134) 2020-09-23 14:00:58 -07:00
gemm_universal.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00
gemm.h CUTLASS 2.2 (#96) 2020-06-08 16:17:35 -07:00