vllm/quantization_utils at 82857368400bcf6a12a3d42a3ccdc5f585153404 - vllm

History

twaka 8285736840 workaround of AWQ for Turing GPUs (#1252 )		2023-10-10 19:48:16 -07:00
..
__init__.py	Implement AWQ quantization support for LLaMA (#1032 )	2023-09-16 00:03:37 -07:00
awq.py	workaround of AWQ for Turing GPUs (#1252 )	2023-10-10 19:48:16 -07:00
base.py	Add minimum capability requirement for AWQ (#1064 )	2023-09-18 12:02:01 -07:00