vllm/vllm/lora/ops
2024-09-23 18:57:42 +00:00
..
__init__.py [Kernel][RFC] Refactor the punica kernel based on Triton (#5036) 2024-07-31 17:12:24 -07:00
bgmv_expand_slice.py [Kernel][LoRA] Add assertion for punica sgmv kernels (#7585) 2024-09-23 18:57:42 +00:00
bgmv_expand.py [Kernel][LoRA] Add assertion for punica sgmv kernels (#7585) 2024-09-23 18:57:42 +00:00
bgmv_shrink.py [Bugfix] Make torch registration of punica ops optional (#7970) 2024-08-28 16:11:49 -06:00
sgmv_expand_slice.py [Kernel][LoRA] Add assertion for punica sgmv kernels (#7585) 2024-09-23 18:57:42 +00:00
sgmv_expand.py [Kernel][LoRA] Add assertion for punica sgmv kernels (#7585) 2024-09-23 18:57:42 +00:00
sgmv_shrink.py [Kernel][LoRA] Add assertion for punica sgmv kernels (#7585) 2024-09-23 18:57:42 +00:00
utils.py [Kernel][RFC] Refactor the punica kernel based on Triton (#5036) 2024-07-31 17:12:24 -07:00