vllm/tests/core
Ricky Xu 4634a89d18
Prefix Cache Aware Scheduling [1/n] (#10128)
Signed-off-by: rickyx <rickyx@anyscale.com>
2024-11-22 21:15:55 -08:00
..
block Prefix Cache Aware Scheduling [1/n] (#10128) 2024-11-22 21:15:55 -08:00
__init__.py [Tests] Add block manager and scheduler tests (#3108) 2024-03-05 18:23:34 -08:00
test_chunked_prefill_scheduler.py [core] simplify seq group code (#9569) 2024-10-24 00:16:44 -07:00
test_num_computed_tokens_update.py [Core] Deprecating block manager v1 and make block manager v2 default (#8704) 2024-10-17 11:38:15 -05:00
test_scheduler_encoder_decoder.py [Model] Add user-configurable task for models that support both generation and embedding (#9424) 2024-10-18 11:31:58 -07:00
test_scheduler.py Prefix Cache Aware Scheduling [1/n] (#10128) 2024-11-22 21:15:55 -08:00
test_serialization.py [Core] Optimize SPMD architecture with delta + serialization optimization (#7109) 2024-08-18 17:57:20 -07:00
utils.py Prefix Cache Aware Scheduling [1/n] (#10128) 2024-11-22 21:15:55 -08:00