vllm/engine at 15310b5101963818b76f1821e93887cb22f0aea6 - vllm

History

Michael Goin 15310b5101 [Bugfix] Use LoadFormat values for `vllm serve --load-format` (#7784 )		2024-08-22 11:37:08 -07:00
..
output_processor	[mypy] Enable following imports for entrypoints (#7248 )	2024-08-20 23:28:21 -07:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Bugfix] Use LoadFormat values for `vllm serve --load-format` (#7784 )	2024-08-22 11:37:08 -07:00
async_llm_engine.py	[misc] Add Torch profiler support (#7451 )	2024-08-21 15:39:26 -07:00
async_timeout.py	[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )	2024-06-19 13:57:12 -07:00
llm_engine.py	[multi-step] Raise error if not using async engine (#7703 )	2024-08-21 11:49:19 -07:00
metrics_types.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
metrics.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
protocol.py	[misc] Add Torch profiler support (#7451 )	2024-08-21 15:39:26 -07:00