vllm/.buildkite
2024-05-16 10:02:56 -07:00
..
check-wheel-size.py [Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535) 2024-05-09 18:04:17 -06:00
download-images.sh [Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00
run-amd-test.sh [Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (#4642) 2024-05-07 09:23:17 -07:00
run-benchmarks.sh Add JSON output support for benchmark_latency and benchmark_throughput (#4848) 2024-05-16 10:02:56 -07:00
run-cpu-test.sh [HotFix] [CI/Build] Minor fix for CPU backend CI (#3787) 2024-04-01 22:50:53 -07:00
run-neuron-test.sh [CI] clean docker cache for neuron (#4441) 2024-04-28 23:32:07 +00:00
test-pipeline.yaml [Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840) 2024-05-16 00:53:51 -07:00
test-template.j2 [Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (#4642) 2024-05-07 09:23:17 -07:00