vllm/.buildkite at 7041de43849fda7c8e931f0726f3db2a0d8015a4 - vllm

History

Lily Liu 7041de4384 [Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628 ) Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>		2024-06-28 15:28:49 -07:00
..
nightly-benchmarks	[ci] Add A100 queue into AWS CI template (#5648 )	2024-06-19 08:42:13 -06:00
check-wheel-size.py	[CI/Build] increase wheel size limit to 200 MB (#5130 )	2024-05-30 06:29:48 -07:00
download-images.sh	[Feature] Add vision language model support. (#3042 )	2024-03-25 14:16:30 -07:00
release-pipeline.yaml	[ci] Setup Release pipeline and build release wheels with cache (#5610 )	2024-06-18 11:00:36 -07:00
run-amd-test.sh	[CI/Build] Docker cleanup functionality for amd servers (#5112 )	2024-05-30 03:27:39 +00:00
run-benchmarks.sh	[ci] Fix Buildkite agent path (#5392 )	2024-06-10 18:58:07 -07:00
run-cpu-test.sh	[CI/Build][Misc] Update Pytest Marker for VLMs (#5623 )	2024-06-18 13:10:04 +00:00
run-neuron-test.sh	[CI] clean docker cache for neuron (#4441 )	2024-04-28 23:32:07 +00:00
run-openvino-test.sh	[Hardware][Intel] OpenVINO vLLM backend (#5379 )	2024-06-28 13:50:16 +00:00
run-xpu-test.sh	[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814 )	2024-06-17 11:01:25 -07:00
test-pipeline.yaml	[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628 )	2024-06-28 15:28:49 -07:00