vllm/tests
wbn dacaf5a400
Replace head_mapping params with num_kv_heads to attention kernel. (#1997)
Co-authored-by: wangguoya <wangguoya@baidu.com>
Co-authored-by: Yang Zhao <zhaoyangstar@foxmail.com>
2023-12-10 10:12:53 -08:00
..
async_engine Support chat template and echo for chat API (#1756) 2023-11-30 16:43:13 -08:00
distributed [Fix] Fix comm test (#1691) 2023-11-16 16:28:39 -08:00
engine Migrate linter from pylint to ruff (#1665) 2023-11-20 11:58:01 -08:00
kernels Replace head_mapping params with num_kv_heads to attention kernel. (#1997) 2023-12-10 10:12:53 -08:00
models Support Microsoft Phi 1.5 (#1664) 2023-11-16 14:28:39 -08:00
samplers Fix broken sampler tests (#1896) 2023-12-02 16:06:17 -08:00
worker Fix broken worker test (#1900) 2023-12-02 22:17:33 -08:00
__init__.py [Small] Formatter only checks lints in changed files (#1528) 2023-10-31 15:39:38 -07:00
conftest.py Migrate linter from pylint to ruff (#1665) 2023-11-20 11:58:01 -08:00
test_regression.py [Minor] Fix duplication of ignored seq group in engine step (#1666) 2023-11-16 13:11:41 -08:00