vllm/vllm/worker
youkaichao cea808f325
[3/N] model runner pass the whole config to model (#9958)
Signed-off-by: youkaichao <youkaichao@gmail.com>
2024-11-02 12:08:49 -07:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
cache_engine.py [Kernel] Support sliding window in flash attention backend (#9403) 2024-10-20 10:57:52 -07:00
cpu_enc_dec_model_runner.py [Hardware][CPU] Cross-attention and Encoder-Decoder models support on CPU backend (#9089) 2024-10-07 06:50:35 +00:00
cpu_model_runner.py [3/N] model runner pass the whole config to model (#9958) 2024-11-02 12:08:49 -07:00
cpu_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
embedding_model_runner.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
enc_dec_model_runner.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
model_runner_base.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
model_runner.py [3/N] model runner pass the whole config to model (#9958) 2024-11-02 12:08:49 -07:00
multi_step_model_runner.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
multi_step_tpu_worker.py [TPU] Implement multi-step scheduling (#8489) 2024-09-14 16:58:31 -07:00
multi_step_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
neuron_model_runner.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
neuron_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
openvino_model_runner.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
openvino_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
tpu_model_runner.py [3/N] model runner pass the whole config to model (#9958) 2024-11-02 12:08:49 -07:00
tpu_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
utils.py [Doc] Compatibility matrix for mutual exclusive features (#8512) 2024-10-11 11:18:50 -07:00
worker_base.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00
xpu_model_runner.py [3/N] model runner pass the whole config to model (#9958) 2024-11-02 12:08:49 -07:00
xpu_worker.py [2/N] executor pass the complete config to worker/modelrunner (#9938) 2024-11-02 07:35:05 -07:00