vllm/vllm/worker
Stephanie Wang dda4811591
[Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408)
Signed-off-by: Stephanie Wang <swang@cs.berkeley.edu>
Signed-off-by: Stephanie <swang@anyscale.com>
Co-authored-by: Stephanie <swang@anyscale.com>
2024-06-25 20:30:03 -07:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
cache_engine.py [Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) 2024-06-17 11:01:25 -07:00
cpu_model_runner.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
cpu_worker.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
embedding_model_runner.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
model_runner_base.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
model_runner.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
neuron_model_runner.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
neuron_worker.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
tpu_model_runner.py [Hardware][TPU] Raise errors for unsupported sampling params (#5850) 2024-06-25 16:58:23 -07:00
tpu_worker.py [Hardware][TPU] Refactor TPU backend (#5831) 2024-06-25 15:25:52 -07:00
worker_base.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
worker.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
xpu_model_runner.py [Core] Refactor Worker and ModelRunner to consolidate control plane communication (#5408) 2024-06-25 20:30:03 -07:00
xpu_worker.py [Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) 2024-06-17 11:01:25 -07:00