vllm/executor at db3bf7c991cd1a0297d1a8ba501e59cfa226c337 - vllm

History

Woosuk Kwon e2b2aa5a0f [TPU] Align worker index with node boundary (#7932 )		2024-09-01 23:09:46 -07:00
..
__init__.py	Add distributed model executor abstraction (#3191 )	2024-03-11 11:03:45 -07:00
cpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
distributed_gpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
executor_base.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
gpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
msgspec_utils.py	[Core] Optimize SPMD architecture with delta + serialization optimization (#7109 )	2024-08-18 17:57:20 -07:00
multiproc_gpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
multiproc_worker_utils.py	[Core][Doc] Default to multiprocessing for single-node distributed case (#5230 )	2024-06-11 11:10:41 -07:00
multiproc_xpu_executor.py	[Hardware][Intel GPU] Add intel GPU pipeline parallel support. (#7810 )	2024-08-27 10:07:02 -07:00
neuron_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
openvino_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
ray_gpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
ray_tpu_executor.py	[TPU] Align worker index with node boundary (#7932 )	2024-09-01 23:09:46 -07:00
ray_utils.py	[TPU] Support single and multi-host TPUs on GKE (#7613 )	2024-08-30 00:27:40 -07:00
ray_xpu_executor.py	[Hardware][Intel GPU] refactor xpu_model_runner for tp (#7712 )	2024-08-22 20:06:54 -07:00
tpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00
xpu_executor.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00