This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
667
Commits
1
Branch
0
Tags
24
MiB
18bfcdd05c
Commit Graph
3 Commits
Author
SHA1
Message
Date
shiyi.c_98
d10f8e1d43
[Experimental] Prefix Caching Support (
#1669
)
...
Co-authored-by: DouHappy <2278958187@qq.com> Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2024-01-17 16:32:10 -08:00
Zhuohan Li
fd4ea8ef5c
Use NCCL instead of ray for control-plane communication to remove serialization overhead (
#2221
)
2024-01-03 11:30:22 -08:00
Woosuk Kwon
cd3aa153a4
Fix broken worker test (
#1900
)
2023-12-02 22:17:33 -08:00