vllm/models at 4ca2c358b178d5e026db925a1ed9f8945010a98f - vllm

History

Philipp Moritz 4ca2c358b1 Add documentation section about LoRA (#2834 )		2024-02-12 17:24:45 +01:00
..
adding_model.rst	Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )	2024-01-03 11:30:22 -08:00
engine_args.rst	[Docs] Update documentation for gpu-memory-utilization option (#2162 )	2023-12-17 10:51:57 -08:00
lora.rst	Add documentation section about LoRA (#2834 )	2024-02-12 17:24:45 +01:00
supported_models.rst	Add Internlm2 (#2666 )	2024-02-01 09:27:40 -08:00