vllm/tests/worker
Cade Daniel 5757d90e26
[Speculative decoding] Adding configuration object for speculative decoding (#3706)
Co-authored-by: Lily Liu <lilyliupku@gmail.com>
2024-04-03 00:40:57 +00:00
..
__init__.py [Speculative decoding 2/9] Multi-step worker for draft model (#2424) 2024-01-21 16:31:47 -08:00
test_model_runner.py [2/N] Chunked prefill data update (#3538) 2024-03-28 10:06:01 -07:00
test_swap.py [Speculative decoding] Adding configuration object for speculative decoding (#3706) 2024-04-03 00:40:57 +00:00