[Docs] [Spec decode] Fix docs error in code example (#5427)
This commit is contained in:
parent
9fde251bf0
commit
89ec06c33b
@ -17,6 +17,7 @@ Speculating with a draft model
|
|||||||
The following code configures vLLM to use speculative decoding with a draft model, speculating 5 tokens at a time.
|
The following code configures vLLM to use speculative decoding with a draft model, speculating 5 tokens at a time.
|
||||||
|
|
||||||
.. code-block:: python
|
.. code-block:: python
|
||||||
|
|
||||||
from vllm import LLM, SamplingParams
|
from vllm import LLM, SamplingParams
|
||||||
|
|
||||||
prompts = [
|
prompts = [
|
||||||
@ -45,6 +46,7 @@ The following code configures vLLM to use speculative decoding where proposals a
|
|||||||
matching n-grams in the prompt. For more information read `this thread. <https://x.com/joao_gante/status/1747322413006643259>`_
|
matching n-grams in the prompt. For more information read `this thread. <https://x.com/joao_gante/status/1747322413006643259>`_
|
||||||
|
|
||||||
.. code-block:: python
|
.. code-block:: python
|
||||||
|
|
||||||
from vllm import LLM, SamplingParams
|
from vllm import LLM, SamplingParams
|
||||||
|
|
||||||
prompts = [
|
prompts = [
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user