vllm/tests
Zhuohan Li 947b794146
[Sampler] Vectorized sampling (simplified) (#1048)
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
2023-09-22 17:48:04 -07:00
..
async_engine Remove AsyncLLMEngine busy loop, shield background task (#1059) 2023-09-17 00:29:08 -07:00
engine Fix detokenization leaving special tokens (#1044) 2023-09-14 16:37:03 -07:00
kernels Use FP32 in RoPE initialization (#1004) 2023-09-11 00:26:35 -07:00
models Add tests for models (#922) 2023-09-01 11:19:43 +09:00
samplers [Sampler] Vectorized sampling (simplified) (#1048) 2023-09-22 17:48:04 -07:00
conftest.py Use queue for finished requests (#957) 2023-09-05 19:27:23 -07:00