This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
7297fa6f7c
vllm
/
cacheflow
/
model_executor
History
Zhuohan Li
7297fa6f7c
Remove unused parts in Megatron-LM code and add copyright notice (
#110
)
2023-05-20 09:11:34 -06:00
..
layers
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
models
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
parallel_utils
Remove unused parts in Megatron-LM code and add copyright notice (
#110
)
2023-05-20 09:11:34 -06:00
__init__.py
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
input_metadata.py
Implement presence and frequency penalties (
#95
)
2023-05-10 23:39:12 -07:00
model_loader.py
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
utils.py
Use runtime profiling to replace manual memory analyzers (
#81
)
2023-05-19 11:35:44 -06:00
weight_utils.py
Add docstrings to some modules and classes (
#100
)
2023-05-14 22:32:38 -07:00