vllm/docs/source
Roger Wang 6206dcb29e
[Model] Add PaliGemma (#5189)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-07-07 09:25:50 +08:00
..
_templates/sections [misc][doc] try to add warning for latest html (#5979) 2024-07-04 09:57:09 -07:00
assets [Doc] add visualization for multi-stage dockerfile (#4456) 2024-04-30 17:41:59 +00:00
automatic_prefix_caching [Doc] Add an automatic prefix caching section in vllm documentation (#5324) 2024-06-11 10:24:59 -07:00
community [Docs] Add ZhenFund as a Sponsor (#5548) 2024-06-14 11:17:21 -07:00
dev [Doc] Move guide for multimodal model and other improvements (#6168) 2024-07-06 17:18:59 +08:00
getting_started [doc][misc] bump up py version in installation doc (#6119) 2024-07-03 15:52:04 -07:00
models [Model] Add PaliGemma (#5189) 2024-07-07 09:25:50 +08:00
quantization [Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975) 2024-07-03 17:38:00 +00:00
serving [Bugfix][Doc] Fix Doc Formatting (#6048) 2024-07-01 15:09:11 -07:00
conf.py [Docs] Fix readthedocs for tag build (#6158) 2024-07-05 12:44:40 -07:00
generate_examples.py Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00
index.rst [Doc] Move guide for multimodal model and other improvements (#6168) 2024-07-06 17:18:59 +08:00