vllm/csrc/quantization/gguf
2024-11-22 21:14:49 -08:00
..
dequantize.cuh [Bugfix][Kernel] Add IQ1_M quantization implementation to GGUF kernel (#8357) 2024-09-15 16:51:44 -06:00
ggml-common.h [AMD] Add support for GGUF quantization on ROCm (#10254) 2024-11-22 21:14:49 -08:00
gguf_kernel.cu [AMD] Add support for GGUF quantization on ROCm (#10254) 2024-11-22 21:14:49 -08:00
mmq.cuh [AMD] Add support for GGUF quantization on ROCm (#10254) 2024-11-22 21:14:49 -08:00
mmvq.cuh [AMD] Add support for GGUF quantization on ROCm (#10254) 2024-11-22 21:14:49 -08:00
vecdotq.cuh [AMD] Add support for GGUF quantization on ROCm (#10254) 2024-11-22 21:14:49 -08:00