vllm/docs/source/models
2024-03-21 09:45:24 +00:00
..
adding_model.rst Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221) 2024-01-03 11:30:22 -08:00
engine_args.rst Add Automatic Prefix Caching (#2762) 2024-03-02 00:50:01 -08:00
lora.rst [Doc] Add docs about OpenAI compatible server (#3288) 2024-03-18 22:05:34 -07:00
supported_models.rst [🚀 Ready to be merged] Added support for Jais models (#3183) 2024-03-21 09:45:24 +00:00