vllm/cutlass_w8a8 at 8674f9880e2d8574c2adc759027e0f27dc9b95de - vllm

History

Tyler Michael Smith 8674f9880e [Kernel] Fixup for CUTLASS kernels in CUDA graphs (#4954 ) Pass the CUDA stream into the CUTLASS GEMMs, to avoid future issues with CUDA graphs		2024-05-22 14:10:43 +00:00
..
common.hpp	[Kernel] Add w8a8 CUTLASS kernels (#4749 )	2024-05-16 18:32:50 -04:00
cutlass_visitor_2x_broadcast_epilogue.hpp	[Kernel] Add w8a8 CUTLASS kernels (#4749 )	2024-05-16 18:32:50 -04:00
scaled_mm_dq_c2x.cu	[Kernel] Fixup for CUTLASS kernels in CUDA graphs (#4954 )	2024-05-22 14:10:43 +00:00
scaled_mm_dq_c3x.cu	[Kernel] Fixup for CUTLASS kernels in CUDA graphs (#4954 )	2024-05-22 14:10:43 +00:00
scaled_mm_dq_entry.cu	[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722 )	2024-05-22 07:18:41 +00:00