vllm/lora at 96b6f475dda40a0c7d557f73c36fe09c07be2e9c - vllm

History

Kunshang Ji 96b6f475dd Remove hardcoded `device="cuda"` to support more devices (#2503 ) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>		2024-02-01 15:46:39 -08:00
..
__init__.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
conftest.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_layers.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_llama.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_lora_manager.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_lora.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_punica.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_tokenizer.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_utils.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00
test_worker.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
utils.py	[Experimental] Add multi-LoRA support (#1804 )	2024-01-23 15:26:37 -08:00