vllm/benchmark
2023-05-11 15:45:30 -07:00
..
benchmark_attention.py Add query stride to multi_query_cached_kv_attention & Add kernel benchmark script (#27) 2023-04-08 13:36:09 -07:00
benchmark_cache.py Memcpy kernel for flash attention (#29) 2023-04-10 18:22:49 -07:00
benchmark_latency.py Enhance SamplingParams (#96) 2023-05-11 15:45:30 -07:00
benchmark_text_completion.py New weight loader without np copy (#52) 2023-05-03 15:32:04 +08:00
trace.py Collect system stats in scheduler & Add scripts for experiments (#30) 2023-04-12 15:03:49 -07:00