vllm/.buildkite
Lily Liu 7041de4384
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
2024-06-28 15:28:49 -07:00
..
nightly-benchmarks [ci] Add A100 queue into AWS CI template (#5648) 2024-06-19 08:42:13 -06:00
check-wheel-size.py [CI/Build] increase wheel size limit to 200 MB (#5130) 2024-05-30 06:29:48 -07:00
download-images.sh [Feature] Add vision language model support. (#3042) 2024-03-25 14:16:30 -07:00
release-pipeline.yaml [ci] Setup Release pipeline and build release wheels with cache (#5610) 2024-06-18 11:00:36 -07:00
run-amd-test.sh [CI/Build] Docker cleanup functionality for amd servers (#5112) 2024-05-30 03:27:39 +00:00
run-benchmarks.sh [ci] Fix Buildkite agent path (#5392) 2024-06-10 18:58:07 -07:00
run-cpu-test.sh [CI/Build][Misc] Update Pytest Marker for VLMs (#5623) 2024-06-18 13:10:04 +00:00
run-neuron-test.sh [CI] clean docker cache for neuron (#4441) 2024-04-28 23:32:07 +00:00
run-openvino-test.sh [Hardware][Intel] OpenVINO vLLM backend (#5379) 2024-06-28 13:50:16 +00:00
run-xpu-test.sh [Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) 2024-06-17 11:01:25 -07:00
test-pipeline.yaml [Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628) 2024-06-28 15:28:49 -07:00