This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
f746ced08d
vllm
/
cacheflow
/
model_executor
History
Woosuk Kwon
f746ced08d
Implement stop strings and best_of (
#114
)
2023-05-21 11:18:00 -07:00
..
layers
Implement stop strings and best_of (
#114
)
2023-05-21 11:18:00 -07:00
models
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
parallel_utils
Remove unused parts in Megatron-LM code and add copyright notice (
#110
)
2023-05-20 09:11:34 -06:00
__init__.py
Refactor system architecture (
#109
)
2023-05-20 13:06:59 -07:00
input_metadata.py
Implement presence and frequency penalties (
#95
)
2023-05-10 23:39:12 -07:00
model_loader.py
Refactor system architecture (
#109
)
2023-05-20 13:06:59 -07:00
utils.py
Refactor system architecture (
#109
)
2023-05-20 13:06:59 -07:00
weight_utils.py
Add docstrings to some modules and classes (
#100
)
2023-05-14 22:32:38 -07:00