vllm/models at ab3a5a8259922ce312d01be39d29e27666968039 - vllm

History

Isotr0py ab3a5a8259 Support OLMo models. (#2832 )		2024-02-18 21:05:15 -08:00
..
adding_model.rst	Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )	2024-01-03 11:30:22 -08:00
engine_args.rst	[Docs] Update documentation for gpu-memory-utilization option (#2162 )	2023-12-17 10:51:57 -08:00
lora.rst	multi-LoRA as extra models in OpenAI server (#2775 )	2024-02-17 12:00:48 -08:00
supported_models.rst	Support OLMo models. (#2832 )	2024-02-18 21:05:15 -08:00