vllm/csrc/quantization/fp8
2024-08-16 10:06:30 -07:00
..
amd [Kernel] Squash a few more warnings (#6914) 2024-07-30 13:50:42 -04:00
nvidia [CI/Build] Suppress divide-by-zero and missing return statement warnings (#7001) 2024-08-05 16:00:01 -04:00
common.cu [Feature][Hardware][Amd] Add fp8 Linear Layer for Rocm (#7210) 2024-08-16 10:06:30 -07:00
fp8_marlin.cu [Kernel][Core] Add AWQ support to the Marlin kernel (#6612) 2024-07-21 19:41:42 -04:00