vllm/.buildkite
Lily Liu 69ec3ca14c
[Kernel][Model] logits_soft_cap for Gemma2 with flashinfer (#6051)
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-07-04 16:35:51 -07:00
..
lm-eval-harness [ Misc ] Refactor MoE to isolate Fp8 From Mixtral (#5970) 2024-07-02 21:54:35 +00:00
nightly-benchmarks [ci] Add A100 queue into AWS CI template (#5648) 2024-06-19 08:42:13 -06:00
check-wheel-size.py [CI/Build] increase wheel size limit to 200 MB (#5130) 2024-05-30 06:29:48 -07:00
download-images.sh [VLM] Remove image_input_type from VLM config (#5852) 2024-07-02 07:57:09 +00:00
release-pipeline.yaml [ci] Setup Release pipeline and build release wheels with cache (#5610) 2024-06-18 11:00:36 -07:00
run-amd-test.sh [CI/Build] Docker cleanup functionality for amd servers (#5112) 2024-05-30 03:27:39 +00:00
run-benchmarks.sh [ci] Fix Buildkite agent path (#5392) 2024-06-10 18:58:07 -07:00
run-cpu-test.sh [Hardware][Intel CPU] Adding intel openmp tunings in Docker file (#6008) 2024-07-04 15:22:12 -07:00
run-neuron-test.sh [CI] clean docker cache for neuron (#4441) 2024-04-28 23:32:07 +00:00
run-openvino-test.sh [Hardware][Intel] OpenVINO vLLM backend (#5379) 2024-06-28 13:50:16 +00:00
run-xpu-test.sh [Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814) 2024-06-17 11:01:25 -07:00
test-pipeline.yaml [Kernel][Model] logits_soft_cap for Gemma2 with flashinfer (#6051) 2024-07-04 16:35:51 -07:00