vllm/engine at 929b4f2973ec6a53ea4f0f03d21147ef8b8278be - vllm

History

Liangfu Chen 3b7178cfa4 [Neuron] Support inference with transformers-neuronx (#2569 )		2024-02-28 09:34:34 -08:00
..
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
arg_utils.py	[Neuron] Support inference with transformers-neuronx (#2569 )	2024-02-28 09:34:34 -08:00
async_llm_engine.py	fix some bugs (#2689 )	2024-01-31 10:09:23 -08:00
llm_engine.py	[Neuron] Support inference with transformers-neuronx (#2569 )	2024-02-28 09:34:34 -08:00
metrics.py	Port metrics from `aioprometheus` to `prometheus_client` (#2730 )	2024-02-25 11:54:00 -08:00
ray_utils.py	[Ray] Integration compiled DAG off by default (#2471 )	2024-02-08 09:57:25 -08:00