vllm/models at c6202daeedb22cd675942c37ae5e194549803c89 - vllm

History

Alex Brooks c6202daeed [Model] Support multiple images for qwen-vl (#8247 ) Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>		2024-09-12 10:10:54 -07:00
..
__init__.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
test_aqlm.py	[Kernel][Misc] register ops to prevent graph breaks (#6917 )	2024-09-11 12:52:19 -07:00
test_bart.py	[Model] Add UltravoxModel and UltravoxConfig (#7615 )	2024-08-21 22:49:39 +00:00
test_big_models.py	[CI/Build] Reuse code for checking output consistency (#5988 )	2024-06-30 11:44:25 +08:00
test_blip2.py	[Model] Add UltravoxModel and UltravoxConfig (#7615 )	2024-08-21 22:49:39 +00:00
test_chameleon.py	[Model] Add UltravoxModel and UltravoxConfig (#7615 )	2024-08-21 22:49:39 +00:00
test_danube3_4b.py	[Model] H2O Danube3-4b (#6451 )	2024-07-26 20:47:50 -07:00
test_embedding.py	[CI/Test] improve robustness of test (vllm_runner) (#5357 )	2024-06-08 08:59:20 +00:00
test_fp8.py	[MISC] Consolidate FP8 kv-cache tests (#8131 )	2024-09-04 18:53:25 +00:00
test_fuyu.py	[VLM][Core] Support profiling with multiple multi-modal inputs per prompt (#7126 )	2024-08-14 17:55:42 +00:00
test_gguf.py	[Core] Support tensor parallelism for GGUF quantization (#7520 )	2024-08-19 17:30:14 -04:00
test_gptq_marlin_24.py	[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations (#5466 )	2024-06-13 15:18:08 +00:00
test_gptq_marlin.py	add gptq_marlin test for bug report https://github.com/vllm-project/vllm/issues/5088 (#5145 )	2024-06-15 13:38:16 -04:00
test_granite.py	[Model] Add Granite model (#7436 )	2024-09-01 18:37:18 -07:00
test_intern_vit.py	[VLM][Model] TP support for ViTs (#7186 )	2024-08-30 08:19:27 -07:00
test_internvl.py	[Bugfix] Fix InternVL2 inference with various num_patches (#8375 )	2024-09-12 10:10:35 -07:00
test_jamba.py	[Kernel] W8A16 Int8 inside FusedMoE (#7415 )	2024-08-16 10:06:51 -07:00
test_llava_image_embeds.py	[Model] Add UltravoxModel and UltravoxConfig (#7615 )	2024-08-21 22:49:39 +00:00
test_llava_next_video.py	[model] Support for Llava-Next-Video model (#7559 )	2024-09-10 22:21:36 -07:00
test_llava_next.py	[Model] Add multi-image input support for LLaVA-Next offline inference (#7230 )	2024-08-28 07:09:02 +08:00
test_llava.py	[Model] Multi-input support for LLaVA (#8238 )	2024-09-07 02:57:24 +00:00
test_marlin.py	[CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations (#5466 )	2024-06-13 15:18:08 +00:00
test_minicpmv.py	[CI/Build][VLM] Cleanup multiple images inputs model test (#7897 )	2024-08-27 15:28:30 +00:00
test_mistral.py	[Model] Allow loading from original Mistral format (#8168 )	2024-09-06 17:02:05 -06:00
test_modelopt.py	[Hardware][NV] Add support for ModelOpt static scaling checkpoints. (#6112 )	2024-09-11 00:38:40 -04:00
test_models.py	[CI/Build] Reuse code for checking output consistency (#5988 )	2024-06-30 11:44:25 +08:00
test_oot_registration.py	[misc][ci] fix cpu test with plugins (#7489 )	2024-08-13 19:27:46 -07:00
test_paligemma.py	[Model] Add UltravoxModel and UltravoxConfig (#7615 )	2024-08-21 22:49:39 +00:00
test_phi3v.py	[Model][VLM] Support multi-images inputs for InternVL2 models (#8201 )	2024-09-07 16:38:23 +08:00
test_phimoe.py	[Model] Adding support for MSFT Phi-3.5-MoE (#7729 )	2024-08-30 13:42:57 -06:00
test_pixtral.py	Pixtral (#8377 )	2024-09-11 14:41:55 -07:00
test_qwen.py	[Model] Support multiple images for qwen-vl (#8247 )	2024-09-12 10:10:54 -07:00
test_registry.py	[Model][VLM] Add Qwen2-VL model support (#7905 )	2024-09-11 09:31:19 -07:00
test_ultravox.py	[Model] Add Ultravox support for multiple audio chunks (#7963 )	2024-09-04 04:38:21 +00:00
utils.py	[Core] Logprobs support in Multi-step (#7652 )	2024-08-29 19:19:08 -07:00