vllm/engine at 008cf886c9361e696f70a15a282d72b58686468a - vllm

History

Harsha vardhan manoj Bikki 008cf886c9 [Neuron] Adding support for adding/ overriding neuron configuration a… (#8062 ) Co-authored-by: Harsha Bikki <harbikh@amazon.com>		2024-09-04 16:33:43 -07:00
..
output_processor	[Core] Optimize Async + Multi-step (#8050 )	2024-09-03 18:50:29 +00:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Neuron] Adding support for adding/ overriding neuron configuration a… (#8062 )	2024-09-04 16:33:43 -07:00
async_llm_engine.py	[Core] Optimize Async + Multi-step (#8050 )	2024-09-03 18:50:29 +00:00
async_timeout.py	[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )	2024-06-19 13:57:12 -07:00
llm_engine.py	[Neuron] Adding support for adding/ overriding neuron configuration a… (#8062 )	2024-09-04 16:33:43 -07:00
metrics_types.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
metrics.py	[MISC] Add prefix cache hit rate to metrics (#7606 )	2024-08-19 11:52:07 -07:00
protocol.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00