vllm/vllm/engine
2024-10-11 11:08:11 +00:00
..
multiprocessing [Misc] Collect model support info in a single process per model (#9233) 2024-10-11 11:08:11 +00:00
output_processor [misc] hide best_of from engine (#9261) 2024-10-10 21:30:44 -07:00
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
arg_utils.py [Misc] Collect model support info in a single process per model (#9233) 2024-10-11 11:08:11 +00:00
async_llm_engine.py [Frontend] API support for beam search for MQLLMEngine (#9117) 2024-10-08 05:51:43 +00:00
async_timeout.py [Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654) 2024-06-19 13:57:12 -07:00
llm_engine.py [misc] hide best_of from engine (#9261) 2024-10-10 21:30:44 -07:00
metrics_types.py [misc] hide best_of from engine (#9261) 2024-10-10 21:30:44 -07:00
metrics.py [misc] hide best_of from engine (#9261) 2024-10-10 21:30:44 -07:00
protocol.py [Core] [Frontend] Priority scheduling for embeddings and in the OpenAI-API (#8965) 2024-10-01 09:58:06 +00:00