vllm/models at 7a64d24aad69e4d2548aa0bf528d9fe63428ab01 - vllm

History

Cyrus Leung 7a64d24aad [Core] Support image processor (#4197 )		2024-06-02 22:56:41 -07:00
..
__init__.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
test_aqlm.py	AQLM CUDA support (#3287 )	2024-04-23 13:59:33 -04:00
test_big_models.py	[Kernel] Add flash-attn back (#4907 )	2024-05-19 18:11:30 -07:00
test_embedding.py	[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734 )	2024-05-11 11:30:37 -07:00
test_fp8.py	[Misc] Load FP8 kv-cache scaling factors from checkpoints (#4893 )	2024-05-22 13:28:20 -07:00
test_gptq_marlin_24.py	Add GPTQ Marlin 2:4 sparse structured support (#4790 )	2024-05-16 12:56:15 -04:00
test_gptq_marlin.py	[Kernel] add bfloat16 support for gptq marlin kernel (#4788 )	2024-05-16 09:55:29 -04:00
test_llava.py	[Core] Support image processor (#4197 )	2024-06-02 22:56:41 -07:00
test_marlin.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
test_mistral.py	[Bugfix] Fix Mistral v0.3 Weight Loading (#5005 )	2024-05-24 12:28:27 +00:00
test_models.py	[Misc]Add customized information for models (#4132 )	2024-04-30 21:18:14 -07:00
test_oot_registration.py	[Core] enable out-of-tree model register (#3871 )	2024-04-06 17:11:41 -07:00
test_registry.py	[Bugfix][Model] Add base class for vision-language models (#4809 )	2024-05-19 00:13:33 -07:00
utils.py	[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (#3922 )	2024-04-29 09:35:34 -07:00