vllm/docs/source/models
张大成 48a8f4a7fd
Support Orion model (#2539)
Co-authored-by: zhangdacheng <zhangdacheng@ainirobot.com>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-02-26 19:17:06 -08:00
..
adding_model.rst Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221) 2024-01-03 11:30:22 -08:00
engine_args.rst [Docs] Update documentation for gpu-memory-utilization option (#2162) 2023-12-17 10:51:57 -08:00
lora.rst multi-LoRA as extra models in OpenAI server (#2775) 2024-02-17 12:00:48 -08:00
supported_models.rst Support Orion model (#2539) 2024-02-26 19:17:06 -08:00