vllm/engine at 3f60f2244e3ffec6198d7a41765918d1efd3bb96 - vllm

History

Alexander Matveev 3f60f2244e [Core] Combine async postprocessor and multi-step (#7921 )		2024-08-29 11:18:26 -07:00
..
output_processor	[Core] Asynchronous Output Processor (#7049 )	2024-08-26 20:53:20 -07:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Model] Add Mistral Tokenization to improve robustness and chat encoding (#7739 )	2024-08-27 12:40:02 +00:00
async_llm_engine.py	[Core] Combine async postprocessor and multi-step (#7921 )	2024-08-29 11:18:26 -07:00
async_timeout.py	[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )	2024-06-19 13:57:12 -07:00
llm_engine.py	[Core] Combine async postprocessor and multi-step (#7921 )	2024-08-29 11:18:26 -07:00
metrics_types.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
metrics.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
protocol.py	[misc] Add Torch profiler support (#7451 )	2024-08-21 15:39:26 -07:00