vllm/engine at 36ea79079bc499cd8fb07d3fe82fe069564e5570 - vllm

History

Cyrus Leung e808156f30 [Misc] Collect model support info in a single process per model (#9233 )		2024-10-11 11:08:11 +00:00
..
multiprocessing	[Misc] Collect model support info in a single process per model (#9233 )	2024-10-11 11:08:11 +00:00
output_processor	[misc] hide best_of from engine (#9261 )	2024-10-10 21:30:44 -07:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Misc] Collect model support info in a single process per model (#9233 )	2024-10-11 11:08:11 +00:00
async_llm_engine.py	[Frontend] API support for beam search for MQLLMEngine (#9117 )	2024-10-08 05:51:43 +00:00
async_timeout.py	[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )	2024-06-19 13:57:12 -07:00
llm_engine.py	[misc] hide best_of from engine (#9261 )	2024-10-10 21:30:44 -07:00
metrics_types.py	[misc] hide best_of from engine (#9261 )	2024-10-10 21:30:44 -07:00
metrics.py	[misc] hide best_of from engine (#9261 )	2024-10-10 21:30:44 -07:00
protocol.py	[Core] [Frontend] Priority scheduling for embeddings and in the OpenAI-API (#8965 )	2024-10-01 09:58:06 +00:00