vllm/models at 93b38bea5dd03e1b140ca997dfaadef86f8f1855 - vllm

History

Junyang Lin 2832e7b9f9 fix names and license for Qwen2 (#2589 )		2024-01-24 22:37:51 -08:00
..
adding_model.rst	Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )	2024-01-03 11:30:22 -08:00
engine_args.rst	[Docs] Update documentation for gpu-memory-utilization option (#2162 )	2023-12-17 10:51:57 -08:00
supported_models.rst	fix names and license for Qwen2 (#2589 )	2024-01-24 22:37:51 -08:00