vllm/weight_loading at 8678a69ab51956031e3bb70bdf1a781a8652e67d - vllm

History

Dipika Sikka 8678a69ab5 [Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7527 ) Co-authored-by: ElizaWszola <eliza@neuralmagic.com>		2024-08-21 16:17:10 -07:00
..
models.txt	[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7527 )	2024-08-21 16:17:10 -07:00
run_model_weight_loading_test.sh	[Misc] Update `gptq_marlin` to use new vLLMParameters (#7281 )	2024-08-13 14:30:11 -04:00
test_weight_loading.py	[Misc] Update `gptq_marlin` to use new vLLMParameters (#7281 )	2024-08-13 14:30:11 -04:00