vllm/benchmark
2023-04-08 23:36:12 -07:00
..
benchmark_attention.py Add query stride to multi_query_cached_kv_attention & Add kernel benchmark script (#27) 2023-04-08 13:36:09 -07:00
benchmark_latency.py Add an option to use dummy model weights (#33) 2023-04-08 23:36:12 -07:00