vllm/vllm/worker
2024-01-08 10:11:06 -08:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
cache_engine.py [Build] Avoid building too many extensions (#1624) 2023-11-23 16:31:19 -08:00
model_runner.py Fix eager mode performance (#2377) 2024-01-08 10:11:06 -08:00
worker.py Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221) 2024-01-03 11:30:22 -08:00