vllm/worker at 8438e0569eaf8496aa3d41deb808f2c831b64ecf - vllm

History

youkaichao 8438e0569e [Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 ) [Core] replace narrow-usage RayWorkerVllm to general WorkerWrapper to reduce code duplication (#4024)		2024-04-17 08:34:33 +00:00
..
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
cache_engine.py	[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837 )	2024-04-09 11:44:15 -07:00
cpu_model_runner.py	[Core] Refactor model loading code (#4097 )	2024-04-16 11:34:39 -07:00
cpu_worker.py	[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )	2024-04-17 08:34:33 +00:00
model_runner.py	[Core] Refactor model loading code (#4097 )	2024-04-16 11:34:39 -07:00
neuron_model_runner.py	[Core] Refactor model loading code (#4097 )	2024-04-16 11:34:39 -07:00
neuron_worker.py	[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )	2024-04-17 08:34:33 +00:00
worker_base.py	[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )	2024-04-17 08:34:33 +00:00
worker.py	[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )	2024-04-17 08:34:33 +00:00