This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
1a2aef3e59
vllm
/
tests
/
entrypoints
History
Alexander Matveev
1a2aef3e59
Add output streaming support to multi-step + async while ensuring RequestOutput obj reuse (
#8335
)
2024-09-23 15:38:04 -07:00
..
llm
[Core] Support load and unload LoRA in api server (
#6566
)
2024-09-05 18:10:33 -07:00
offline_mode
[Bugfix] Offline mode fix (
#8376
)
2024-09-12 11:11:57 -07:00
openai
Add output streaming support to multi-step + async while ensuring RequestOutput obj reuse (
#8335
)
2024-09-23 15:38:04 -07:00
__init__.py
[CI/Build] Move
test_utils.py
to
tests/utils.py
(
#4425
)
2024-05-13 23:50:09 +09:00
conftest.py
Support for guided decoding for offline LLM (
#6878
)
2024-08-04 03:12:09 +00:00
test_chat_utils.py
[Frontend] Multimodal support in offline chat (
#8098
)
2024-09-04 05:22:17 +00:00