cutlass/include/cutlass
reed 19f3cc33f1
Fix uint128 operator add (#1400)
* fix uint128 operator add for 64-bit hilo implemenation

* add uint128 test for operator add

* make clang happy

---------

Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2024-04-02 13:32:18 -04:00
..
arch CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
conv CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
detail group gemm set stride L = cute::Int<0> (#1416) 2024-03-20 17:31:14 -04:00
epilogue CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
gemm CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
layout CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
pipeline CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
platform Update license year (#1306) 2024-01-16 14:37:22 -05:00
reduction CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
thread Update license year (#1306) 2024-01-16 14:37:22 -05:00
transform CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
aligned_buffer.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
array_planar_complex.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
array_subbyte.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
array.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
barrier.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
bfloat16.h Add a missing platform include (#1328) 2024-02-03 01:30:32 -05:00
blas3_types.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
blas3.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
block_striped.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
cluster_launch.hpp CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
complex.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
constants.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
coord.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
core_io.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
cuda_host_adapter.hpp CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
cutlass.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
device_kernel.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
fast_math.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
float8.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
floating_point_nvrtc.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
functional.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
gemm_coord.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
gemm_coord.hpp Update license year (#1306) 2024-01-16 14:37:22 -05:00
half.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
integer_subbyte.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
kernel_hardware_info.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
kernel_hardware_info.hpp Update license year (#1306) 2024-01-16 14:37:22 -05:00
kernel_launch.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
matrix_coord.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
matrix_shape.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
matrix.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
numeric_conversion.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
numeric_size.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
numeric_types.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
pitch_linear_coord.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicate_vector.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
quaternion.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
real.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
relatively_equal.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
semaphore.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
subbyte_reference.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tensor_coord.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tensor_ref_planar_complex.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tensor_ref.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tensor_view_planar_complex.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tensor_view.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
tfloat32.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
trace.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
uint128.h Fix uint128 operator add (#1400) 2024-04-02 13:32:18 -04:00
version.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
wmma_array.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
workspace.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00