vllm/lora at dfea17314827845d55dabb03ebe905f58e6682e4 - vllm

History

Austin Veselka eefeb16464 [Kernel] Full Tensor Parallelism for LoRA Layers (#3524 ) Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>		2024-04-27 00:03:48 -07:00
..
__init__.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
fully_sharded_layers.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
layers.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
lora.py	[Mypy] Typing lora folder (#4337 )	2024-04-25 19:13:50 +00:00
models.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
punica.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
request.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
utils.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
worker_manager.py	[Mypy] Typing lora folder (#4337 )	2024-04-25 19:13:50 +00:00