This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
cf5cb1e33e
vllm
/
tests
History
Antoni Baum
cf5cb1e33e
Allocate more shared memory to attention kernel (
#1154
)
2023-09-26 22:27:13 -07:00
..
async_engine
Remove AsyncLLMEngine busy loop, shield background task (
#1059
)
2023-09-17 00:29:08 -07:00
engine
Fix detokenization leaving special tokens (
#1044
)
2023-09-14 16:37:03 -07:00
kernels
Allocate more shared memory to attention kernel (
#1154
)
2023-09-26 22:27:13 -07:00
models
Add tests for models (
#922
)
2023-09-01 11:19:43 +09:00
samplers
[Sampler] Vectorized sampling (simplified) (
#1048
)
2023-09-22 17:48:04 -07:00
conftest.py
Use queue for finished requests (
#957
)
2023-09-05 19:27:23 -07:00