vllm/executor at b5672a112c0e6c550f2dfa75c6cbe940a3f933bc - vllm

History

Nick Hill b5672a112c [Core] Multiprocessing Pipeline Parallel support (#6130 ) Co-authored-by: Murali Andoorveedu <muralidhar.andoorveedu@centml.ai>		2024-07-18 19:15:52 -07:00
..
__init__.py	Add distributed model executor abstraction (#3191 )	2024-03-11 11:03:45 -07:00
cpu_executor.py	[CORE] Adding support for insertion of soft-tuned prompts (#4645 )	2024-07-09 13:26:36 -07:00
distributed_gpu_executor.py	[Core] Introduce SPMD worker execution using Ray accelerated DAG (#6032 )	2024-07-17 22:27:09 -07:00
executor_base.py	[Core] Multiprocessing Pipeline Parallel support (#6130 )	2024-07-18 19:15:52 -07:00
gpu_executor.py	[Core] Multiprocessing Pipeline Parallel support (#6130 )	2024-07-18 19:15:52 -07:00
multiproc_gpu_executor.py	[Core] Multiprocessing Pipeline Parallel support (#6130 )	2024-07-18 19:15:52 -07:00
multiproc_worker_utils.py	[Core][Doc] Default to multiprocessing for single-node distributed case (#5230 )	2024-06-11 11:10:41 -07:00
neuron_executor.py	[Bugfix][Neuron] Fix soft prompt method error in NeuronExecutor (#6313 )	2024-07-10 16:39:02 -07:00
openvino_executor.py	[Bugfix] Fix OpenVINOExecutor abstractmethod error (#6296 )	2024-07-10 10:03:32 -07:00
ray_gpu_executor.py	[Core] Multiprocessing Pipeline Parallel support (#6130 )	2024-07-18 19:15:52 -07:00
ray_utils.py	[Core] Introduce SPMD worker execution using Ray accelerated DAG (#6032 )	2024-07-17 22:27:09 -07:00
ray_xpu_executor.py	[Core] Introduce SPMD worker execution using Ray accelerated DAG (#6032 )	2024-07-17 22:27:09 -07:00
tpu_executor.py	[Bugfix][TPU] Add prompt adapter methods to TPUExecutor (#6279 )	2024-07-09 19:30:56 -07:00
xpu_executor.py	[CORE] Adding support for insertion of soft-tuned prompts (#4645 )	2024-07-09 13:26:36 -07:00