vllm/entrypoints at 1a2aef3e59f5429299618bd3b242833cb377f554 - vllm

History

Alexander Matveev 1a2aef3e59 Add output streaming support to multi-step + async while ensuring RequestOutput obj reuse (#8335 )		2024-09-23 15:38:04 -07:00
..
llm	[Core] Support load and unload LoRA in api server (#6566 )	2024-09-05 18:10:33 -07:00
offline_mode	[Bugfix] Offline mode fix (#8376 )	2024-09-12 11:11:57 -07:00
openai	Add output streaming support to multi-step + async while ensuring RequestOutput obj reuse (#8335 )	2024-09-23 15:38:04 -07:00
__init__.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
conftest.py	Support for guided decoding for offline LLM (#6878 )	2024-08-04 03:12:09 +00:00
test_chat_utils.py	[Frontend] Multimodal support in offline chat (#8098 )	2024-09-04 05:22:17 +00:00