vllm/models at ab9e8488d57687d1144e7b4802d195f35882ecdc - vllm

History

GoHomeToMacDonal 1a2bbc9301 ChatGLM Support (#1261 )		2023-11-06 16:09:33 -08:00
..
__init__.py	ChatGLM Support (#1261 )	2023-11-06 16:09:33 -08:00
aquila.py	Add rope_scaling to Aquila model (#1457 )	2023-10-29 04:25:21 -07:00
baichuan.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
bloom.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
chatglm.py	ChatGLM Support (#1261 )	2023-11-06 16:09:33 -08:00
falcon.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
gpt2.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
gpt_bigcode.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
gpt_j.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
gpt_neox.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
internlm.py	Fix bias in InternLM (#1501 )	2023-10-29 16:24:18 -07:00
llama.py	Support SqueezeLLM (#1326 )	2023-10-21 23:14:59 -07:00
mistral.py	Support SqueezeLLM (#1326 )	2023-10-21 23:14:59 -07:00
mpt.py	Remove `MPTConfig` (#1529 )	2023-11-01 15:29:05 -07:00
opt.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
qwen.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
yi.py	Support Yi model (#1567 )	2023-11-06 15:26:03 -08:00