vllm/csrc/quantization/marlin
2024-08-02 13:51:58 -07:00
..
dense Support W4A8 quantization for vllm (#5218) 2024-07-31 07:55:21 -06:00
qqq Support W4A8 quantization for vllm (#5218) 2024-07-31 07:55:21 -06:00
sparse [Misc] Disambiguate quantized types via a new ScalarType (#6396) 2024-08-02 13:51:58 -07:00