vllm/engine at 76a5e13270f32216bb28cfe185bada5e88e407d7 - vllm

History

youkaichao 76a5e13270 [core] move parallel sampling out from vllm core (#9302 )		2024-10-22 00:31:44 +00:00
..
multiprocessing	[Frontend] Don't log duplicate error stacktrace for every request in the batch (#9023 )	2024-10-21 14:49:41 -07:00
output_processor	[CI/Build] mypy: Resolve some errors from checking vllm/engine (#9267 )	2024-10-16 22:55:59 +00:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Doc] update gpu-memory-utilization flag docs (#9507 )	2024-10-19 11:26:36 +08:00
async_llm_engine.py	[Frontend] merge beam search implementations (#9296 )	2024-10-14 15:05:52 -07:00
async_timeout.py	[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )	2024-06-19 13:57:12 -07:00
llm_engine.py	[core] move parallel sampling out from vllm core (#9302 )	2024-10-22 00:31:44 +00:00
metrics_types.py	[MISC] Add lora requests to metrics (#9477 )	2024-10-18 20:50:18 +00:00
metrics.py	[MISC] Add lora requests to metrics (#9477 )	2024-10-18 20:50:18 +00:00
protocol.py	[BugFix] Typing fixes to RequestOutput.prompt and beam search (#9473 )	2024-10-18 07:19:53 +00:00