vllm/csrc/moe
Dipika Sikka fc911880cc
[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766)
Co-authored-by: ElizaWszola <eliza@neuralmagic.com>
2024-08-27 15:07:09 -07:00
..
marlin_moe_ops.cu [Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766) 2024-08-27 15:07:09 -07:00
marlin_moe_ops.h [Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766) 2024-08-27 15:07:09 -07:00
moe_ops.h [Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047) 2024-06-09 16:23:30 -04:00
topk_softmax_kernels.cu [Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047) 2024-06-09 16:23:30 -04:00
torch_bindings.cpp [Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766) 2024-08-27 15:07:09 -07:00