vllm/vllm/engine
2024-10-22 00:31:44 +00:00
..
multiprocessing [Frontend] Don't log duplicate error stacktrace for every request in the batch (#9023) 2024-10-21 14:49:41 -07:00
output_processor [CI/Build] mypy: Resolve some errors from checking vllm/engine (#9267) 2024-10-16 22:55:59 +00:00
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
arg_utils.py [Doc] update gpu-memory-utilization flag docs (#9507) 2024-10-19 11:26:36 +08:00
async_llm_engine.py [Frontend] merge beam search implementations (#9296) 2024-10-14 15:05:52 -07:00
async_timeout.py [Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654) 2024-06-19 13:57:12 -07:00
llm_engine.py [core] move parallel sampling out from vllm core (#9302) 2024-10-22 00:31:44 +00:00
metrics_types.py [MISC] Add lora requests to metrics (#9477) 2024-10-18 20:50:18 +00:00
metrics.py [MISC] Add lora requests to metrics (#9477) 2024-10-18 20:50:18 +00:00
protocol.py [BugFix] Typing fixes to RequestOutput.prompt and beam search (#9473) 2024-10-18 07:19:53 +00:00