[Bugfix] Fix #7592 vllm 0.5.4 enable_chunked_prefill throughput is slightly lower than 0.5.3~0.5.0. (#7874) |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_basic_correctness.py | ||
| test_chunked_prefill.py | ||
| test_cpu_offload.py | ||
| test_preemption.py | ||
[Bugfix] Fix #7592 vllm 0.5.4 enable_chunked_prefill throughput is slightly lower than 0.5.3~0.5.0. (#7874) |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_basic_correctness.py | ||
| test_chunked_prefill.py | ||
| test_cpu_offload.py | ||
| test_preemption.py | ||