vllm/vllm/engine
2024-02-04 14:32:42 -08:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
arg_utils.py Remove hardcoded device="cuda" to support more devices (#2503) 2024-02-01 15:46:39 -08:00
async_llm_engine.py fix some bugs (#2689) 2024-01-31 10:09:23 -08:00
llm_engine.py Remove eos tokens from output by default (#2611) 2024-02-04 14:32:42 -08:00
metrics.py Refactor Prometheus and Add Request Level Metrics (#2316) 2024-01-31 14:58:07 -08:00
ray_utils.py [Minor] Fix warning on Ray dependencies (#2630) 2024-01-27 15:43:40 -08:00