vllm/benchmarks/cutlass_benchmarks
Varun Sundar Rabindranath 35e9c12bfa
[Kernel] Tuned int8 Cutlass Kernels for SM75 (T4) (#6996)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-07-31 14:40:32 -07:00
..
w8a8_benchmarks.py [Kernel] Tuned int8 Cutlass Kernels for SM75 (T4) (#6996) 2024-07-31 14:40:32 -07:00
weight_shapes.py Fix w8a8 benchmark and add Llama-3-8B (#5562) 2024-06-17 06:48:06 +00:00