vllm/csrc/punica
2024-03-27 00:37:42 +00:00
..
bgmv Enable more models to inference based on LoRA (#3382) 2024-03-25 18:09:31 -07:00
LICENSE [Experimental] Add multi-LoRA support (#1804) 2024-01-23 15:26:37 -08:00
punica_ops.cc [Kernel] support non-zero cuda devices in punica kernels (#3636) 2024-03-27 00:37:42 +00:00