vllm/vllm/model_executor/quantization_utils
2023-10-10 19:48:16 -07:00
..
__init__.py Implement AWQ quantization support for LLaMA (#1032) 2023-09-16 00:03:37 -07:00
awq.py workaround of AWQ for Turing GPUs (#1252) 2023-10-10 19:48:16 -07:00
base.py Add minimum capability requirement for AWQ (#1064) 2023-09-18 12:02:01 -07:00