vllm/csrc/quantization/gguf
Isotr0py 360bd67cf0
[Core] Support loading GGUF model (#5191)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-05 17:54:23 -06:00
..
dequantize.cuh [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
ggml-common.h [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
gguf_kernel.cu [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
mmq.cuh [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
mmvq.cuh [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
vecdotq.cuh [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00