vllm/benchmarks/kernels
Philipp Moritz cfc15a1031
Optimize Triton MoE Kernel (#2979)
Co-authored-by: Cade Daniel <edacih@gmail.com>
2024-02-26 13:48:56 -08:00
..
benchmark_mixtral_moe.py Optimize Triton MoE Kernel (#2979) 2024-02-26 13:48:56 -08:00
benchmark_paged_attention.py Remove hardcoded device="cuda" to support more devices (#2503) 2024-02-01 15:46:39 -08:00