vllm/source at 16620f439db1f2cc91b5582b59fc8845cbb02881 - vllm

History

Roger Wang 6206dcb29e [Model] Add PaliGemma (#5189 ) Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>		2024-07-07 09:25:50 +08:00
..
_templates/sections	[misc][doc] try to add warning for latest html (#5979 )	2024-07-04 09:57:09 -07:00
assets	[Doc] add visualization for multi-stage dockerfile (#4456 )	2024-04-30 17:41:59 +00:00
automatic_prefix_caching	[Doc] Add an automatic prefix caching section in vllm documentation (#5324 )	2024-06-11 10:24:59 -07:00
community	[Docs] Add ZhenFund as a Sponsor (#5548 )	2024-06-14 11:17:21 -07:00
dev	[Doc] Move guide for multimodal model and other improvements (#6168 )	2024-07-06 17:18:59 +08:00
getting_started	[doc][misc] bump up py version in installation doc (#6119 )	2024-07-03 15:52:04 -07:00
models	[Model] Add PaliGemma (#5189 )	2024-07-07 09:25:50 +08:00
quantization	[Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975 )	2024-07-03 17:38:00 +00:00
serving	[Bugfix][Doc] Fix Doc Formatting (#6048 )	2024-07-01 15:09:11 -07:00
conf.py	[Docs] Fix readthedocs for tag build (#6158 )	2024-07-05 12:44:40 -07:00
generate_examples.py	Add example scripts to documentation (#4225 )	2024-04-22 16:36:54 +00:00
index.rst	[Doc] Move guide for multimodal model and other improvements (#6168 )	2024-07-06 17:18:59 +08:00