This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
62b8aebc6f
vllm
/
vllm
/
spec_decode
History
Cade Daniel
62b8aebc6f
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
..
__init__.py
[Bugfix] Add
__init__.py
files for
vllm/core/block/
and
vllm/spec_decode/
(
#3798
)
2024-04-02 12:35:31 -07:00
batch_expansion.py
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
interfaces.py
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
metrics.py
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
multi_step_worker.py
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
spec_decode_worker.py
[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (
#3951
)
2024-04-23 08:02:36 +00:00
util.py
[Speculative decoding 6/9] Integrate speculative decoding with LLMEngine (
#3894
)
2024-04-16 13:09:21 -07:00