vllm/vllm/engine
2024-02-28 09:34:34 -08:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
arg_utils.py [Neuron] Support inference with transformers-neuronx (#2569) 2024-02-28 09:34:34 -08:00
async_llm_engine.py fix some bugs (#2689) 2024-01-31 10:09:23 -08:00
llm_engine.py [Neuron] Support inference with transformers-neuronx (#2569) 2024-02-28 09:34:34 -08:00
metrics.py Port metrics from aioprometheus to prometheus_client (#2730) 2024-02-25 11:54:00 -08:00
ray_utils.py [Ray] Integration compiled DAG off by default (#2471) 2024-02-08 09:57:25 -08:00