This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
4cc24f01b1
vllm
/
vllm
/
transformers_utils
/
configs
History
Abhinav Goyal
2416b26e11
[Speculative Decoding] Medusa Implementation with Top-1 proposer (
#4978
)
2024-07-09 18:34:02 -07:00
..
__init__.py
[Speculative Decoding] Medusa Implementation with Top-1 proposer (
#4978
)
2024-07-09 18:34:02 -07:00
arctic.py
[Model] Snowflake arctic model implementation (
#4652
)
2024-05-09 22:37:14 +00:00
chatglm.py
[Lora] Support long context lora (
#4787
)
2024-05-18 16:05:23 +09:00
dbrx.py
[CI] Disable non-lazy string operation on logging (
#4326
)
2024-04-26 00:16:58 -07:00
falcon.py
Add Falcon support (new) (
#592
)
2023-08-02 14:04:39 -07:00
jais.py
[Mypy] Part 3 fix typing for nested directories for most of directory (
#4161
)
2024-04-22 21:32:44 -07:00
medusa.py
[Speculative Decoding] Medusa Implementation with Top-1 proposer (
#4978
)
2024-07-09 18:34:02 -07:00
mlp_speculator.py
[Model] Changes to MLPSpeculator to support tie_weights and input_scale (
#5965
)
2024-07-01 16:40:02 -07:00
mpt.py
[CI] Try introducing isort. (
#3495
)
2024-03-25 07:59:47 -07:00