vllm/.buildkite/lm-eval-harness/configs
2024-07-14 13:37:19 +00:00
..
DeepSeek-V2-Lite-Chat.yaml [ Misc ] Apply MoE Refactor to Deepseekv2 To Support Fp8 (#6417) 2024-07-13 20:03:58 -07:00
Meta-Llama-3-8B-Instruct-FP8-compressed-tensors.yaml [Kernel] Turn off CUTLASS scaled_mm for Ada Lovelace (#6384) 2024-07-14 13:37:19 +00:00
Meta-Llama-3-8B-Instruct-FP8.yaml [Kernel] Turn off CUTLASS scaled_mm for Ada Lovelace (#6384) 2024-07-14 13:37:19 +00:00
Meta-Llama-3-8B-Instruct-INT8-compressed-tensors.yaml [ Misc ] Support Fp8 via llm-compressor (#6110) 2024-07-07 20:42:11 +00:00
Meta-Llama-3-8B-Instruct.yaml [ CI/Build ] LM Eval Harness Based CI Testing (#5838) 2024-06-29 13:04:30 -04:00
Meta-Llama-3-70B-Instruct.yaml [ CI/Build ] LM Eval Harness Based CI Testing (#5838) 2024-06-29 13:04:30 -04:00
Mixtral-8x7B-Instruct-v0.1-FP8.yaml [ Misc ] Refactor MoE to isolate Fp8 From Mixtral (#5970) 2024-07-02 21:54:35 +00:00
Mixtral-8x7B-Instruct-v0.1.yaml [ CI/Build ] LM Eval Harness Based CI Testing (#5838) 2024-06-29 13:04:30 -04:00
Mixtral-8x22B-Instruct-v0.1-FP8-Dynamic.yaml [ Misc ] Refactor MoE to isolate Fp8 From Mixtral (#5970) 2024-07-02 21:54:35 +00:00
models-large.txt [ Misc ] Apply MoE Refactor to Deepseekv2 To Support Fp8 (#6417) 2024-07-13 20:03:58 -07:00
models-small.txt [ Misc ] Support Models With Bias in compressed-tensors integration (#6356) 2024-07-12 11:11:29 -04:00
Qwen2-1.5B-Instruct-INT8-compressed-tensors.yaml [ Misc ] Support Models With Bias in compressed-tensors integration (#6356) 2024-07-12 11:11:29 -04:00
Qwen2-1.5B-Instruct-W8A16-compressed-tensors.yaml [ Misc ] Support Models With Bias in compressed-tensors integration (#6356) 2024-07-12 11:11:29 -04:00
Qwen2-57B-A14-Instruct.yaml [ Misc ] Refactor MoE to isolate Fp8 From Mixtral (#5970) 2024-07-02 21:54:35 +00:00