vllm/vllm/__init__.py

"""vLLM: a high-throughput and memory-efficient inference engine for LLMs"""

from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs
from vllm.engine.async_llm_engine import AsyncLLMEngine
from vllm.engine.llm_engine import LLMEngine
from vllm.entrypoints.llm import LLM
from vllm.executor.ray_utils import initialize_ray_cluster
from vllm.inputs import PromptType, TextPrompt, TokensPrompt
from vllm.model_executor.models import ModelRegistry
from vllm.outputs import (CompletionOutput, EmbeddingOutput,
                          EmbeddingRequestOutput, RequestOutput)
from vllm.pooling_params import PoolingParams
from vllm.sampling_params import SamplingParams

from .version import __version__, __version_tuple__

__all__ = [
    "__version__",
    "__version_tuple__",
    "LLM",
    "ModelRegistry",
    "PromptType",
    "TextPrompt",
    "TokensPrompt",
    "SamplingParams",
    "RequestOutput",
    "CompletionOutput",
    "EmbeddingOutput",
    "EmbeddingRequestOutput",
    "LLMEngine",
    "EngineArgs",
    "AsyncLLMEngine",
    "AsyncEngineArgs",
    "initialize_ray_cluster",
    "PoolingParams",
]
[Quality] Add code formatter and linter (#326) 2023-07-04 02:31:55 +08:00			`"""vLLM: a high-throughput and memory-efficient inference engine for LLMs"""`

[FIX] Make `flash_attn` optional (#3269) 2024-03-09 02:52:20 +08:00			`from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs`
			`from vllm.engine.async_llm_engine import AsyncLLMEngine`
			`from vllm.engine.llm_engine import LLMEngine`
			`from vllm.entrypoints.llm import LLM`
[Core] Move ray_utils.py from `engine` to `executor` package (#4347) 2024-04-25 14:52:22 +08:00			`from vllm.executor.ray_utils import initialize_ray_cluster`
[Core] rename`PromptInputs` and `inputs` (#8876) 2024-09-27 11:35:15 +08:00			`from vllm.inputs import PromptType, TextPrompt, TokensPrompt`
[Core] enable out-of-tree model register (#3871) 2024-04-07 08:11:41 +08:00			`from vllm.model_executor.models import ModelRegistry`
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734) 2024-05-12 02:30:37 +08:00			`from vllm.outputs import (CompletionOutput, EmbeddingOutput,`
			`EmbeddingRequestOutput, RequestOutput)`
			`from vllm.pooling_params import PoolingParams`
[FIX] Make `flash_attn` optional (#3269) 2024-03-09 02:52:20 +08:00			`from vllm.sampling_params import SamplingParams`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00
[CI/Build] use setuptools-scm to set __version__ (#4738) Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-24 00:44:26 +08:00			`from .version import __version__, __version_tuple__`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00
			`__all__ = [`
[Misc] Add vLLM version getter to utils (#5098) 2024-06-14 02:21:39 +08:00			`"__version__",`
[CI/Build] use setuptools-scm to set __version__ (#4738) Co-authored-by: youkaichao <youkaichao@126.com> 2024-09-24 00:44:26 +08:00			`"__version_tuple__",`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00			`"LLM",`
[Core] enable out-of-tree model register (#3871) 2024-04-07 08:11:41 +08:00			`"ModelRegistry",`
[Core] rename`PromptInputs` and `inputs` (#8876) 2024-09-27 11:35:15 +08:00			`"PromptType",`
[Core] Consolidate prompt arguments to LLM engines (#4328) Co-authored-by: Roger Wang <ywang@roblox.com> 2024-05-29 04:29:31 +08:00			`"TextPrompt",`
			`"TokensPrompt",`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00			`"SamplingParams",`
			`"RequestOutput",`
			`"CompletionOutput",`
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734) 2024-05-12 02:30:37 +08:00			`"EmbeddingOutput",`
			`"EmbeddingRequestOutput",`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00			`"LLMEngine",`
			`"EngineArgs",`
			`"AsyncLLMEngine",`
			`"AsyncEngineArgs",`
Add distributed model executor abstraction (#3191) 2024-03-12 02:03:45 +08:00			`"initialize_ray_cluster",`
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734) 2024-05-12 02:30:37 +08:00			`"PoolingParams",`
Change the name to vLLM (#150) 2023-06-17 18:07:40 +08:00			`]`