vllm/docs/source
youkaichao d03d64fd2e
[CI/Build] refactor dockerfile & fix pip cache
[CI/Build] fix pip cache with vllm_nccl & refactor dockerfile to build wheels (#3859)
2024-04-04 21:53:16 -07:00
..
assets fix document error for value and v_vec illustration (#3421) 2024-03-15 16:06:09 -07:00
dev [Doc] Add docs about OpenAI compatible server (#3288) 2024-03-18 22:05:34 -07:00
getting_started [Hardware][Intel] Add CPU inference backend (#3634) 2024-04-01 22:07:30 -07:00
models [Doc]Add asynchronous engine arguments to documentation. (#3810) 2024-04-04 21:52:01 -07:00
quantization Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290) 2024-04-03 14:15:55 -07:00
serving Usage Stats Collection (#2852) 2024-03-28 22:16:12 -07:00
conf.py [CI/Build] refactor dockerfile & fix pip cache 2024-04-04 21:53:16 -07:00
index.rst Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290) 2024-04-03 14:15:55 -07:00