vllm/docs/source
Sanger Steel d619ae2d19
[Doc] Add better clarity for tensorizer usage (#4090)
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
2024-04-15 13:28:25 -07:00
..
assets fix document error for value and v_vec illustration (#3421) 2024-03-15 16:06:09 -07:00
dev [Doc] Add docs about OpenAI compatible server (#3288) 2024-03-18 22:05:34 -07:00
getting_started [Doc][Installation] delete python setup.py develop (#3989) 2024-04-11 03:33:02 +00:00
models [Doc] Add better clarity for tensorizer usage (#4090) 2024-04-15 13:28:25 -07:00
quantization Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290) 2024-04-03 14:15:55 -07:00
serving [Doc] Fix getting stared to use publicly available model (#3963) 2024-04-10 18:05:52 +00:00
conf.py [Frontend] [Core] feat: Add model loading using tensorizer (#3476) 2024-04-13 17:13:01 -07:00
index.rst Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290) 2024-04-03 14:15:55 -07:00