vllm/vllm/worker
youkaichao 8438e0569e
[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024)
[Core] replace narrow-usage RayWorkerVllm to general WorkerWrapper to reduce code duplication (#4024)
2024-04-17 08:34:33 +00:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
cache_engine.py [Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837) 2024-04-09 11:44:15 -07:00
cpu_model_runner.py [Core] Refactor model loading code (#4097) 2024-04-16 11:34:39 -07:00
cpu_worker.py [Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024) 2024-04-17 08:34:33 +00:00
model_runner.py [Core] Refactor model loading code (#4097) 2024-04-16 11:34:39 -07:00
neuron_model_runner.py [Core] Refactor model loading code (#4097) 2024-04-16 11:34:39 -07:00
neuron_worker.py [Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024) 2024-04-17 08:34:33 +00:00
worker_base.py [Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024) 2024-04-17 08:34:33 +00:00
worker.py [Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024) 2024-04-17 08:34:33 +00:00