vllm/vllm at 2acd76f346efcdff4f6ca1d92fe1575c448e4b70 - vllm

History

Woosuk Kwon 2acd76f346 [ROCm] Temporarily remove GPTQ ROCm support (#2138 )		2023-12-15 17:13:58 -08:00
..
core	[FIX] Fix formatting error	2023-11-29 00:40:19 +00:00
engine	Add GPTQ support (#916 )	2023-12-15 03:04:22 -08:00
entrypoints	Add GPTQ support (#916 )	2023-12-15 03:04:22 -08:00
model_executor	Add GPTQ support (#916 )	2023-12-15 03:04:22 -08:00
transformers_utils	Fix Baichuan tokenizer error (#1874 )	2023-11-30 18:35:50 -08:00
worker	[BugFix] Fix input positions for long context with sliding window (#2088 )	2023-12-13 12:28:13 -08:00
__init__.py	Bump up to v0.2.5 (#2095 )	2023-12-13 23:56:15 -08:00
block.py	[Quality] Add code formatter and linter (#326 )	2023-07-03 11:31:55 -07:00
config.py	[ROCm] Temporarily remove GPTQ ROCm support (#2138 )	2023-12-15 17:13:58 -08:00
logger.py	[Fix] Fix duplicated logging messages (#1524 )	2023-10-31 09:04:47 -07:00
outputs.py	docs: add description (#1553 )	2023-11-03 09:14:52 -07:00
py.typed	Add py.typed so consumers of vLLM can get type checking (#1509 )	2023-10-30 14:50:47 -07:00
sampling_params.py	Add a flag to include stop string in output text (#1976 )	2023-12-15 00:45:58 -08:00
sequence.py	[FIX] Fix class naming (#1803 )	2023-11-28 14:08:01 -08:00
utils.py	Fix peak memory profiling (#2031 )	2023-12-12 22:01:53 -08:00