vllm/lora at 10760da8003824e208c94fb2bfcdb6fdd0f4edda - vllm

History

Austin Veselka 10760da800 [Bugfix] Fixed error in slice_lora_b for MergedQKVParallelLinearWithLora (#4609 )		2024-05-07 10:59:07 -07:00
..
__init__.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
fully_sharded_layers.py	[Bugfix] Fixed error in slice_lora_b for MergedQKVParallelLinearWithLora (#4609 )	2024-05-07 10:59:07 -07:00
layers.py	[Bugfix] Fixed error in slice_lora_b for MergedQKVParallelLinearWithLora (#4609 )	2024-05-07 10:59:07 -07:00
lora.py	[Mypy] Typing lora folder (#4337 )	2024-04-25 19:13:50 +00:00
models.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
punica.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
request.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
utils.py	[Kernel] Full Tensor Parallelism for LoRA Layers (#3524 )	2024-04-27 00:03:48 -07:00
worker_manager.py	[Mypy] Typing lora folder (#4337 )	2024-04-25 19:13:50 +00:00