vllm/vllm/engine
2024-09-28 08:52:46 -07:00
..
multiprocessing [Core] renamePromptInputs and inputs (#8876) 2024-09-26 20:35:15 -07:00
output_processor [Bugfix] Fix PP for Multi-Step (#8887) 2024-09-28 08:52:46 -07:00
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
arg_utils.py [Core] Multi-Step + Single Step Prefills via Chunked Prefill code path (#8378) 2024-09-27 13:32:07 -07:00
async_llm_engine.py [Core] Priority-based scheduling in async engine (#8850) 2024-09-27 15:07:10 -07:00
async_timeout.py [Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654) 2024-06-19 13:57:12 -07:00
llm_engine.py [Core] Priority-based scheduling in async engine (#8850) 2024-09-27 15:07:10 -07:00
metrics_types.py [MISC] Add prefix cache hit rate to metrics (#7606) 2024-08-19 11:52:07 -07:00
metrics.py [MISC] Add prefix cache hit rate to metrics (#7606) 2024-08-19 11:52:07 -07:00
protocol.py [Core] renamePromptInputs and inputs (#8876) 2024-09-26 20:35:15 -07:00