vllm/docs/source
Isotr0py 7cbd9ec7a9
[Model] Initialize support for InternVL2 series models (#6514)
Co-authored-by: Roger Wang <ywang@roblox.com>
2024-07-29 10:16:30 +00:00
..
_static [Docs] Add RunLLM chat widget (#6857) 2024-07-27 09:24:46 -07:00
_templates/sections [Doc] Guide for adding multi-modal plugins (#6205) 2024-07-10 14:55:34 +08:00
assets [Doc] add visualization for multi-stage dockerfile (#4456) 2024-04-30 17:41:59 +00:00
automatic_prefix_caching [Doc] Add an automatic prefix caching section in vllm documentation (#5324) 2024-06-11 10:24:59 -07:00
community [Docs] Publish 5th meetup slides (#6799) 2024-07-25 16:47:55 -07:00
dev [Model] Adding support for MiniCPM-V (#4087) 2024-07-24 20:59:30 -07:00
getting_started [TPU] Reduce compilation time & Upgrade PyTorch XLA version (#6856) 2024-07-27 10:28:33 -07:00
models [Model] Initialize support for InternVL2 series models (#6514) 2024-07-29 10:16:30 +00:00
performance_benchmark [Doc] Add documentations for nightly benchmarks (#6412) 2024-07-25 11:57:16 -07:00
quantization [bitsandbytes]: support read bnb pre-quantized model (#5753) 2024-07-23 23:45:09 +00:00
serving [Doc] Update SkyPilot doc for wrong indents and instructions for update service (#4283) 2024-07-26 14:39:10 -07:00
conf.py [Docs] Add RunLLM chat widget (#6857) 2024-07-27 09:24:46 -07:00
generate_examples.py Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00
index.rst [Doc] Add documentations for nightly benchmarks (#6412) 2024-07-25 11:57:16 -07:00