vllm/tests/models
Robert Shaw c0c2335ce0
Integrate Marlin Kernels for Int4 GPTQ inference (#2497)
Co-authored-by: Robert Shaw <114415538+rib-2@users.noreply.github.com>
Co-authored-by: alexm <alexm@neuralmagic.com>
2024-03-01 12:47:51 -08:00
..
test_marlin.py Integrate Marlin Kernels for Int4 GPTQ inference (#2497) 2024-03-01 12:47:51 -08:00
test_mistral.py [BugFix] Fix input positions for long context with sliding window (#2088) 2023-12-13 12:28:13 -08:00
test_models.py Support starcoder2 architecture (#3089) 2024-02-29 00:51:48 -08:00