From 450664121277240b1c52611dbb7381ef2cc75355 Mon Sep 17 00:00:00 2001 From: Roger Wang <136131678+ywang96@users.noreply.github.com> Date: Tue, 20 Aug 2024 23:24:01 -0700 Subject: [PATCH] [Doc] Section for Multimodal Language Models (#7719) --- docs/source/models/supported_models.rst | 16 +++++++++++++--- 1 file changed, 13 insertions(+), 3 deletions(-) diff --git a/docs/source/models/supported_models.rst b/docs/source/models/supported_models.rst index 6fa81e88..c761d1b3 100644 --- a/docs/source/models/supported_models.rst +++ b/docs/source/models/supported_models.rst @@ -177,51 +177,61 @@ Decoder-only Language Models .. _supported_vlms: -Vision Language Models -^^^^^^^^^^^^^^^^^^^^^^^ +Multimodal Language Models +^^^^^^^^^^^^^^^^^^^^^^^^^^^^ .. list-table:: - :widths: 25 25 50 5 + :widths: 25 25 25 25 5 :header-rows: 1 * - Architecture - Models + - Supported Modality(ies) - Example HuggingFace Models - :ref:`LoRA ` * - :code:`Blip2ForConditionalGeneration` - BLIP-2 + - Image - :code:`Salesforce/blip2-opt-2.7b`, :code:`Salesforce/blip2-opt-6.7b`, etc. - * - :code:`ChameleonForConditionalGeneration` - Chameleon + - Image - :code:`facebook/chameleon-7b` etc. - * - :code:`FuyuForCausalLM` - Fuyu + - Image - :code:`adept/fuyu-8b` etc. - * - :code:`InternVLChatModel` - InternVL2 + - Image - :code:`OpenGVLab/InternVL2-4B`, :code:`OpenGVLab/InternVL2-8B`, etc. - * - :code:`LlavaForConditionalGeneration` - LLaVA-1.5 + - Image - :code:`llava-hf/llava-1.5-7b-hf`, :code:`llava-hf/llava-1.5-13b-hf`, etc. - * - :code:`LlavaNextForConditionalGeneration` - LLaVA-NeXT + - Image - :code:`llava-hf/llava-v1.6-mistral-7b-hf`, :code:`llava-hf/llava-v1.6-vicuna-7b-hf`, etc. - * - :code:`PaliGemmaForConditionalGeneration` - PaliGemma + - Image - :code:`google/paligemma-3b-pt-224`, :code:`google/paligemma-3b-mix-224`, etc. - * - :code:`Phi3VForCausalLM` - Phi-3-Vision + - Image - :code:`microsoft/Phi-3-vision-128k-instruct`, etc. - * - :code:`MiniCPMV` - MiniCPM-V + - Image - :code:`openbmb/MiniCPM-V-2` (see note), :code:`openbmb/MiniCPM-Llama3-V-2_5`, :code:`openbmb/MiniCPM-V-2_6`, etc. -