vllm/tests/core
afeldman-nm fd95e026e0
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942)
Co-authored-by: Andrew Feldman <afeld2012@gmail.com>
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
2024-08-06 16:51:47 -04:00
..
block [core][scheduler] simplify and improve scheduler (#6867) 2024-07-31 23:51:09 -07:00
__init__.py [Tests] Add block manager and scheduler tests (#3108) 2024-03-05 18:23:34 -08:00
test_block_manager.py [Core] Avoid the need to pass None values to Sequence.inputs (#5099) 2024-05-29 16:05:01 -07:00
test_chunked_prefill_scheduler.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
test_scheduler_encoder_decoder.py [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) 2024-08-06 16:51:47 -04:00
test_scheduler.py [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) 2024-08-06 16:51:47 -04:00
utils.py [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) 2024-08-06 16:51:47 -04:00