vllm/csrc/quantization/fp8
Luka Govedič 7937009a7e
[Kernel] Replaced blockReduce[...] functions with cub::BlockReduce (#7233)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-21 20:18:00 -04:00
..
amd [Kernel] Squash a few more warnings (#6914) 2024-07-30 13:50:42 -04:00
nvidia [CI/Build] Suppress divide-by-zero and missing return statement warnings (#7001) 2024-08-05 16:00:01 -04:00
common.cu [Kernel] Replaced blockReduce[...] functions with cub::BlockReduce (#7233) 2024-08-21 20:18:00 -04:00
fp8_marlin.cu [Kernel][Core] Add AWQ support to the Marlin kernel (#6612) 2024-07-21 19:41:42 -04:00