cutlass/tools/profiler/src
Pradeep Ramani c008b4aea8
CUTLASS 3.3.0 (#1167)
* Release 3.3.0

Adds support for mixed precision GEMMs On Hopper and Ampere
Adds support for < 16B aligned GEMMs on Hopper
Enhancements to EVT
Enhancements to Python interface
Enhancements to Sub-byte type handling in CuTe
Several other bug-fixes and performance improvements.

* minor doc update
2023-11-02 11:09:05 -04:00
..
conv2d_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
conv3d_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
cublas_helpers.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
cudnn_helpers.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
cutlass_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
device_allocation.cu CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
device_context.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
enumerated_types.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
gemm_operation_profiler.cu CUTLASS 3.3.0 (#1167) 2023-11-02 11:09:05 -04:00
gpu_timer.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
main.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
options.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
performance_report.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
performance_result.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
problem_space.cpp CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
rank_2k_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
rank_k_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
sparse_gemm_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
symm_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00
trmm_operation_profiler.cu CUTLASS 3.2.1 (#1113) 2023-09-26 17:24:26 -04:00