vllm/.buildkite
Robert Shaw abfe705a02
[ Misc ] Support Fp8 via llm-compressor (#6110)
Co-authored-by: Robert Shaw <rshaw@neuralmagic>
2024-07-07 20:42:11 +00:00
..
lm-eval-harness [ Misc ] Support Fp8 via llm-compressor (#6110) 2024-07-07 20:42:11 +00:00
nightly-benchmarks [ci] Add A100 queue into AWS CI template (#5648) 2024-06-19 08:42:13 -06:00
check-wheel-size.py [CI/Build] increase wheel size limit to 200 MB (#5130) 2024-05-30 06:29:48 -07:00
download-images.sh [VLM] Remove image_input_type from VLM config (#5852) 2024-07-02 07:57:09 +00:00
release-pipeline.yaml Move release wheel env var to Dockerfile instead (#6163) 2024-07-05 17:19:53 -07:00
run-amd-test.sh [CI/Build] Docker cleanup functionality for amd servers (#5112) 2024-05-30 03:27:39 +00:00
run-benchmarks.sh [ci] Fix Buildkite agent path (#5392) 2024-06-10 18:58:07 -07:00
run-cpu-test.sh [Hardware][Intel CPU] Adding intel openmp tunings in Docker file (#6008) 2024-07-04 15:22:12 -07:00
run-neuron-test.sh [CI] clean docker cache for neuron (#4441) 2024-04-28 23:32:07 +00:00
run-openvino-test.sh [Hardware][Intel] OpenVINO vLLM backend (#5379) 2024-06-28 13:50:16 +00:00
run-xpu-test.sh [Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) 2024-06-17 11:01:25 -07:00
test-pipeline.yaml [Kernel][Model] logits_soft_cap for Gemma2 with flashinfer (#6051) 2024-07-04 16:35:51 -07:00