vllm/benchmarks/cutlass_benchmarks
Luka Govedič 8d59dbb000
[Kernel] Add per-tensor and per-token AZP epilogues (#5941)
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
2024-08-06 18:17:08 +00:00
..
w8a8_benchmarks.py [Kernel] Add per-tensor and per-token AZP epilogues (#5941) 2024-08-06 18:17:08 +00:00
weight_shapes.py Fix w8a8 benchmark and add Llama-3-8B (#5562) 2024-06-17 06:48:06 +00:00