vllm/csrc/quantization/fp8
2024-07-21 19:41:42 -04:00
..
amd [CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722) 2024-05-22 07:18:41 +00:00
nvidia [CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722) 2024-05-22 07:18:41 +00:00
common.cu [ Kernel ] FP8 Dynamic Per Token Quant - Add scale_ub (#6593) 2024-07-19 18:15:26 -07:00
fp8_marlin.cu [Kernel][Core] Add AWQ support to the Marlin kernel (#6612) 2024-07-21 19:41:42 -04:00