vllm/tests/core
2024-05-02 14:31:20 -07:00
..
block [Core] Enable prefix caching with block manager v2 enabled (#4142) 2024-05-01 11:20:32 -07:00
__init__.py [Tests] Add block manager and scheduler tests (#3108) 2024-03-05 18:23:34 -08:00
test_block_manager.py [Core] Ignore infeasible swap requests. (#4557) 2024-05-02 14:31:20 -07:00
test_chunked_prefill_scheduler.py [Core] Ignore infeasible swap requests. (#4557) 2024-05-02 14:31:20 -07:00
test_scheduler.py [Core] Ignore infeasible swap requests. (#4557) 2024-05-02 14:31:20 -07:00
utils.py [Speculative decoding 6/9] Integrate speculative decoding with LLMEngine (#3894) 2024-04-16 13:09:21 -07:00