vllm/docs/source
Isotr0py fbf152d976
[Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (#4324)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-04-25 09:35:56 -07:00
..
assets fix document error for value and v_vec illustration (#3421) 2024-03-15 16:06:09 -07:00
dev Fix autodoc directives (#4272) 2024-04-23 01:53:01 +00:00
getting_started Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00
models [Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (#4324) 2024-04-25 09:35:56 -07:00
quantization Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290) 2024-04-03 14:15:55 -07:00
serving [Doc] Add note for docker user (#4340) 2024-04-24 21:09:44 +00:00
conf.py Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00
generate_examples.py Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00
index.rst Add example scripts to documentation (#4225) 2024-04-22 16:36:54 +00:00