This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
d6545ad22e
vllm
/
tests
History
Woosuk Kwon
e67b4f2c2a
Use FP32 in RoPE initialization (
#1004
)
...
Co-authored-by: One <imone@tuta.io>
2023-09-11 00:26:35 -07:00
..
async_engine
Start background task in
AsyncLLMEngine.generate
(
#988
)
2023-09-08 00:03:39 -07:00
kernels
Use FP32 in RoPE initialization (
#1004
)
2023-09-11 00:26:35 -07:00
models
Add tests for models (
#922
)
2023-09-01 11:19:43 +09:00
samplers
Align vLLM's beam search implementation with HF generate (
#857
)
2023-09-04 17:29:42 -07:00
conftest.py
Use queue for finished requests (
#957
)
2023-09-05 19:27:23 -07:00