vllm/vllm/model_executor/models
2023-11-06 16:09:33 -08:00
..
__init__.py ChatGLM Support (#1261) 2023-11-06 16:09:33 -08:00
aquila.py Add rope_scaling to Aquila model (#1457) 2023-10-29 04:25:21 -07:00
baichuan.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
bloom.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
chatglm.py ChatGLM Support (#1261) 2023-11-06 16:09:33 -08:00
falcon.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
gpt2.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
gpt_bigcode.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
gpt_j.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
gpt_neox.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
internlm.py Fix bias in InternLM (#1501) 2023-10-29 16:24:18 -07:00
llama.py Support SqueezeLLM (#1326) 2023-10-21 23:14:59 -07:00
mistral.py Support SqueezeLLM (#1326) 2023-10-21 23:14:59 -07:00
mpt.py Remove MPTConfig (#1529) 2023-11-01 15:29:05 -07:00
opt.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
qwen.py TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181) 2023-10-02 15:36:09 -07:00
yi.py Support Yi model (#1567) 2023-11-06 15:26:03 -08:00