[Bugfix] Fix #7592 vllm 0.5.4 enable_chunked_prefill throughput is slightly lower than 0.5.3~0.5.0. (#7874) |
||
|---|---|---|
| .. | ||
| block | ||
| __init__.py | ||
| block_manager_v1.py | ||
| block_manager_v2.py | ||
| embedding_model_block_manager.py | ||
| evictor_v1.py | ||
| evictor_v2.py | ||
| interfaces.py | ||
| scheduler.py | ||