vllm/benchmarks/kernels
James Fleming 2b7949c1c2
AQLM CUDA support (#3287)
Co-authored-by: mgoin <michael@neuralmagic.com>
2024-04-23 13:59:33 -04:00
..
benchmark_aqlm.py AQLM CUDA support (#3287) 2024-04-23 13:59:33 -04:00
benchmark_mixtral_moe.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00
benchmark_paged_attention.py [Misc] Add indirection layer for custom ops (#3913) 2024-04-10 20:26:07 -07:00
benchmark_rope.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00