vllm/.buildkite at 5d5b4c5fe524c3b62453bba7ad4434a27c81317a - vllm

History

Robert Shaw abfe705a02 [ Misc ] Support Fp8 via `llm-compressor` (#6110 ) Co-authored-by: Robert Shaw <rshaw@neuralmagic>		2024-07-07 20:42:11 +00:00
..
lm-eval-harness	[ Misc ] Support Fp8 via `llm-compressor` (#6110 )	2024-07-07 20:42:11 +00:00
nightly-benchmarks	[ci] Add A100 queue into AWS CI template (#5648 )	2024-06-19 08:42:13 -06:00
check-wheel-size.py	[CI/Build] increase wheel size limit to 200 MB (#5130 )	2024-05-30 06:29:48 -07:00
download-images.sh	[VLM] Remove `image_input_type` from VLM config (#5852 )	2024-07-02 07:57:09 +00:00
release-pipeline.yaml	Move release wheel env var to Dockerfile instead (#6163 )	2024-07-05 17:19:53 -07:00
run-amd-test.sh	[CI/Build] Docker cleanup functionality for amd servers (#5112 )	2024-05-30 03:27:39 +00:00
run-benchmarks.sh	[ci] Fix Buildkite agent path (#5392 )	2024-06-10 18:58:07 -07:00
run-cpu-test.sh	[Hardware][Intel CPU] Adding intel openmp tunings in Docker file (#6008 )	2024-07-04 15:22:12 -07:00
run-neuron-test.sh	[CI] clean docker cache for neuron (#4441 )	2024-04-28 23:32:07 +00:00
run-openvino-test.sh	[Hardware][Intel] OpenVINO vLLM backend (#5379 )	2024-06-28 13:50:16 +00:00
run-xpu-test.sh	[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814 )	2024-06-17 11:01:25 -07:00
test-pipeline.yaml	[Kernel][Model] logits_soft_cap for Gemma2 with flashinfer (#6051 )	2024-07-04 16:35:51 -07:00