vllm/csrc/quantization/gguf
2024-09-16 12:15:57 -06:00
..
dequantize.cuh [Bugfix][Kernel] Add IQ1_M quantization implementation to GGUF kernel (#8357) 2024-09-15 16:51:44 -06:00
ggml-common.h [Bugfix][Kernel] Add IQ1_M quantization implementation to GGUF kernel (#8357) 2024-09-15 16:51:44 -06:00
gguf_kernel.cu [Bugfix][Kernel] Add IQ1_M quantization implementation to GGUF kernel (#8357) 2024-09-15 16:51:44 -06:00
mmq.cuh [Core] Support loading GGUF model (#5191) 2024-08-05 17:54:23 -06:00
mmvq.cuh [Bugfix][Kernel] Add IQ1_M quantization implementation to GGUF kernel (#8357) 2024-09-15 16:51:44 -06:00
vecdotq.cuh [Bugfix][Kernel] Fix build for sm_60 in GGUF kernel (#8506) 2024-09-16 12:15:57 -06:00