vllm/tests/models
Robert Shaw 73c8d677e5
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922)
Co-authored-by: alexm <alexm@neuralmagic.com>
Co-authored-by: mgoin <michael@neuralmagic.com>
2024-04-29 09:35:34 -07:00
..
test_aqlm.py AQLM CUDA support (#3287) 2024-04-23 13:59:33 -04:00
test_big_models.py [Test] Make model tests run again and remove --forked from pytest (#3631) 2024-03-28 21:06:40 -07:00
test_gptq_marlin.py [Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922) 2024-04-29 09:35:34 -07:00
test_llava.py [Test] Make model tests run again and remove --forked from pytest (#3631) 2024-03-28 21:06:40 -07:00
test_marlin.py [Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922) 2024-04-29 09:35:34 -07:00
test_mistral.py [Test] Make model tests run again and remove --forked from pytest (#3631) 2024-03-28 21:06:40 -07:00
test_models.py [Core][5/N] Fully working chunked prefill e2e (#3884) 2024-04-10 17:56:48 -07:00
test_oot_registration.py [Core] enable out-of-tree model register (#3871) 2024-04-06 17:11:41 -07:00
utils.py [Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922) 2024-04-29 09:35:34 -07:00