vllm/worker at c42590f97a8fd7bcc22137777f031eeee6df8187 - vllm

History

Kunshang Ji c42590f97a [Hardware] [Intel GPU] refactor xpu worker/executor (#7686 )		2024-08-20 09:54:10 -07:00
..
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
cache_engine.py	[Model] Jamba support (#4115 )	2024-07-02 23:11:29 +00:00
cpu_model_runner.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
cpu_worker.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
embedding_model_runner.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
enc_dec_model_runner.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
model_runner_base.py	[core] Multi Step Scheduling (#7000 )	2024-08-19 13:52:13 -07:00
model_runner.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
multi_step_model_runner.py	[core] Multi Step Scheduling (#7000 )	2024-08-19 13:52:13 -07:00
multi_step_worker.py	[core] Multi Step Scheduling (#7000 )	2024-08-19 13:52:13 -07:00
neuron_model_runner.py	[Bugfix] update neuron for version > 0.5.0 (#7175 )	2024-08-15 09:44:14 -07:00
neuron_worker.py	[Bugfix] update neuron for version > 0.5.0 (#7175 )	2024-08-15 09:44:14 -07:00
openvino_model_runner.py	[Bugfix] Fix broadcasting logic for `multi_modal_kwargs` (#6836 )	2024-07-31 10:38:45 +08:00
openvino_worker.py	[core][distributed] support n layers % pp size != 0 (#6115 )	2024-07-03 16:40:31 -07:00
tpu_model_runner.py	[TPU] Remove redundant input tensor cloning (#7660 )	2024-08-19 15:55:04 -07:00
tpu_worker.py	[TPU] Skip creating empty tensor (#7630 )	2024-08-17 14:22:46 -07:00
utils.py	[VLM] Refactor `MultiModalConfig` initialization and profiling (#7530 )	2024-08-17 13:30:55 -07:00
worker_base.py	[core] Multi Step Scheduling (#7000 )	2024-08-19 13:52:13 -07:00
worker.py	[Core] Optimize SPMD architecture with delta + serialization optimization (#7109 )	2024-08-18 17:57:20 -07:00
xpu_model_runner.py	[Hardware] [Intel GPU] refactor xpu worker/executor (#7686 )	2024-08-20 09:54:10 -07:00
xpu_worker.py	[Hardware] [Intel GPU] refactor xpu worker/executor (#7686 )	2024-08-20 09:54:10 -07:00