vllm/tests
Woosuk Kwon e67b4f2c2a
Use FP32 in RoPE initialization (#1004)
Co-authored-by: One <imone@tuta.io>
2023-09-11 00:26:35 -07:00
..
async_engine Start background task in AsyncLLMEngine.generate (#988) 2023-09-08 00:03:39 -07:00
kernels Use FP32 in RoPE initialization (#1004) 2023-09-11 00:26:35 -07:00
models Add tests for models (#922) 2023-09-01 11:19:43 +09:00
samplers Align vLLM's beam search implementation with HF generate (#857) 2023-09-04 17:29:42 -07:00
conftest.py Use queue for finished requests (#957) 2023-09-05 19:27:23 -07:00