22 lines
396 B
ReStructuredText
22 lines
396 B
ReStructuredText
Welcome to vLLM!
|
|
================
|
|
|
|
vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLM).
|
|
|
|
Documentation
|
|
-------------
|
|
|
|
.. toctree::
|
|
:maxdepth: 1
|
|
:caption: Getting Started
|
|
|
|
getting_started/installation
|
|
getting_started/quickstart
|
|
|
|
.. toctree::
|
|
:maxdepth: 1
|
|
:caption: Models
|
|
|
|
models/supported_models
|
|
models/adding_model
|