vllm/benchmarks/kernels
2024-07-27 17:52:33 -04:00
..
benchmark_aqlm.py [Frontend] Add FlexibleArgumentParser to support both underscore and dash in names (#5718) 2024-06-20 17:00:13 -06:00
benchmark_marlin.py [Kernel] Increase precision of GPTQ/AWQ Marlin kernel (#6795) 2024-07-27 17:52:33 -04:00
benchmark_moe.py [Frontend] Add FlexibleArgumentParser to support both underscore and dash in names (#5718) 2024-06-20 17:00:13 -06:00
benchmark_paged_attention.py [Model] H2O Danube3-4b (#6451) 2024-07-26 20:47:50 -07:00
benchmark_rope.py [Model] H2O Danube3-4b (#6451) 2024-07-26 20:47:50 -07:00
benchmark_shapes.py Add marlin unit tests and marlin benchmark script (#4815) 2024-05-16 09:36:49 -04:00