vllm/csrc/quantization/marlin/dense
2024-07-21 19:41:42 -04:00
..
LICENSE Add GPTQ Marlin 2:4 sparse structured support (#4790) 2024-05-16 12:56:15 -04:00
marlin_cuda_kernel.cu [Kernel][Core] Add AWQ support to the Marlin kernel (#6612) 2024-07-21 19:41:42 -04:00