vllm/tests/models
Shawn Tan f8d60145b4
[Model] Add Granite model (#7436)
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
2024-09-01 18:37:18 -07:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
test_aqlm.py [CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations (#5466) 2024-06-13 15:18:08 +00:00
test_bart.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_big_models.py [CI/Build] Reuse code for checking output consistency (#5988) 2024-06-30 11:44:25 +08:00
test_blip2.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_chameleon.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_danube3_4b.py [Model] H2O Danube3-4b (#6451) 2024-07-26 20:47:50 -07:00
test_embedding.py [CI/Test] improve robustness of test (vllm_runner) (#5357) 2024-06-08 08:59:20 +00:00
test_fp8.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
test_fp8kv_flashinfer.py [Bugfix] bugfix and add model test for flashinfer fp8 kv cache. (#8013) 2024-08-30 22:18:50 -07:00
test_fuyu.py [VLM][Core] Support profiling with multiple multi-modal inputs per prompt (#7126) 2024-08-14 17:55:42 +00:00
test_gguf.py [Core] Support tensor parallelism for GGUF quantization (#7520) 2024-08-19 17:30:14 -04:00
test_gptq_marlin_24.py [CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations (#5466) 2024-06-13 15:18:08 +00:00
test_gptq_marlin.py add gptq_marlin test for bug report https://github.com/vllm-project/vllm/issues/5088 (#5145) 2024-06-15 13:38:16 -04:00
test_granite.py [Model] Add Granite model (#7436) 2024-09-01 18:37:18 -07:00
test_intern_vit.py [VLM][Model] TP support for ViTs (#7186) 2024-08-30 08:19:27 -07:00
test_internvl.py [VLM][Model] TP support for ViTs (#7186) 2024-08-30 08:19:27 -07:00
test_jamba.py [Kernel] W8A16 Int8 inside FusedMoE (#7415) 2024-08-16 10:06:51 -07:00
test_llava_image_embeds.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_llava_next.py [Model] Add multi-image input support for LLaVA-Next offline inference (#7230) 2024-08-28 07:09:02 +08:00
test_llava.py [VLM] Disallow overflowing max_model_len for multimodal models (#7998) 2024-08-29 17:49:04 -07:00
test_marlin.py [CI/Build][REDO] Add is_quant_method_supported to control quantization test configurations (#5466) 2024-06-13 15:18:08 +00:00
test_minicpmv.py [CI/Build][VLM] Cleanup multiple images inputs model test (#7897) 2024-08-27 15:28:30 +00:00
test_mistral.py [Model] Add Mistral Tokenization to improve robustness and chat encoding (#7739) 2024-08-27 12:40:02 +00:00
test_models.py [CI/Build] Reuse code for checking output consistency (#5988) 2024-06-30 11:44:25 +08:00
test_oot_registration.py [misc][ci] fix cpu test with plugins (#7489) 2024-08-13 19:27:46 -07:00
test_paligemma.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_phi3v.py [CI/Build][VLM] Cleanup multiple images inputs model test (#7897) 2024-08-27 15:28:30 +00:00
test_phimoe.py [Model] Adding support for MSFT Phi-3.5-MoE (#7729) 2024-08-30 13:42:57 -06:00
test_qwen.py [Model] Add UltravoxModel and UltravoxConfig (#7615) 2024-08-21 22:49:39 +00:00
test_registry.py [Model] Support SigLIP encoder and alternative decoders for LLaVA models (#7153) 2024-08-06 16:55:31 +08:00
test_ultravox.py [Misc] Optional installation of audio related packages (#8063) 2024-09-01 14:46:57 -07:00
utils.py [Core] Logprobs support in Multi-step (#7652) 2024-08-29 19:19:08 -07:00