vllm/benchmarks/cutlass_benchmarks
Varun Sundar Rabindranath 766435e660
[Kernel] Tuned FP8 Kernels for Ada Lovelace (#6677)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-07-29 09:42:35 -06:00
..
w8a8_benchmarks.py [Kernel] Tuned FP8 Kernels for Ada Lovelace (#6677) 2024-07-29 09:42:35 -06:00
weight_shapes.py Fix w8a8 benchmark and add Llama-3-8B (#5562) 2024-06-17 06:48:06 +00:00