vllm/docs/source/models
Simon Mo 51d4094fda
chunked-prefill-doc-syntax (#4603)
Fix the docs: https://docs.vllm.ai/en/latest/models/performance.html

Co-authored-by: sang <rkooo567@gmail.com>
2024-05-10 14:13:23 +09:00
..
adding_model.rst [Doc]: Update the doc of adding new models (#4236) 2024-04-21 09:57:08 -07:00
engine_args.rst Don't show default value for flags in EngineArgs (#4223) 2024-04-21 09:15:28 -07:00
lora.rst [Doc] Add docs about OpenAI compatible server (#3288) 2024-03-18 22:05:34 -07:00
performance.rst chunked-prefill-doc-syntax (#4603) 2024-05-10 14:13:23 +09:00
supported_models.rst [Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (#4324) 2024-04-25 09:35:56 -07:00