vllm/tests/core
2024-10-10 14:17:17 +08:00
..
block [Core] Add an environment variable which needs to be set explicitly to allow BlockSpaceManagerV1 (#9149) 2024-10-10 14:17:17 +08:00
__init__.py [Tests] Add block manager and scheduler tests (#3108) 2024-03-05 18:23:34 -08:00
test_block_manager.py [Performance] Enable chunked prefill and prefix caching together (#7753) 2024-08-28 00:36:31 -07:00
test_chunked_prefill_scheduler.py [Core] Add an environment variable which needs to be set explicitly to allow BlockSpaceManagerV1 (#9149) 2024-10-10 14:17:17 +08:00
test_num_computed_tokens_update.py [Bugfix] Fix incorrect updates to num_computed_tokens in multi-step scheduling (#9038) 2024-10-06 12:48:11 -07:00
test_scheduler_encoder_decoder.py [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) 2024-08-06 16:51:47 -04:00
test_scheduler.py [Core] Add an environment variable which needs to be set explicitly to allow BlockSpaceManagerV1 (#9149) 2024-10-10 14:17:17 +08:00
test_serialization.py [Core] Optimize SPMD architecture with delta + serialization optimization (#7109) 2024-08-18 17:57:20 -07:00
utils.py [core] remove beam search from the core (#9105) 2024-10-07 05:47:04 +00:00