[Misc] Fix typo in scheduler.py (#4022)
This commit is contained in:
parent
96b6a6d790
commit
d4ec9ffb95
@ -674,7 +674,7 @@ class Scheduler:
|
|||||||
def _schedule_default(self) -> SchedulerOutputs:
|
def _schedule_default(self) -> SchedulerOutputs:
|
||||||
"""Schedule queued requests.
|
"""Schedule queued requests.
|
||||||
|
|
||||||
The current policy is designed to opimimize the throughput. First,
|
The current policy is designed to optimize the throughput. First,
|
||||||
it batches as many prefill requests as possible. And it schedules
|
it batches as many prefill requests as possible. And it schedules
|
||||||
decodes. If there's a pressure on GPU memory, decode requests can
|
decodes. If there's a pressure on GPU memory, decode requests can
|
||||||
be swapped or preempted.
|
be swapped or preempted.
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user