vllm/tests
Antoni Baum 080438477f
Start background task in AsyncLLMEngine.generate (#988)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-09-08 00:03:39 -07:00
..
async_engine Start background task in AsyncLLMEngine.generate (#988) 2023-09-08 00:03:39 -07:00
kernels [FIX] Fix Alibi implementation in PagedAttention kernel (#945) 2023-09-07 15:53:14 -07:00
models Add tests for models (#922) 2023-09-01 11:19:43 +09:00
samplers Align vLLM's beam search implementation with HF generate (#857) 2023-09-04 17:29:42 -07:00
conftest.py Use queue for finished requests (#957) 2023-09-05 19:27:23 -07:00