This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
d189170b6c
vllm
/
vllm
/
model_executor
History
Wang Ran (汪然)
d189170b6c
remove useless statements (
#1408
)
2023-10-20 08:52:07 -07:00
..
layers
Change scheduler & input tensor shape (
#1381
)
2023-10-16 17:48:42 -07:00
models
Fix the issue for AquilaChat2-* models (
#1339
)
2023-10-13 11:51:29 -07:00
parallel_utils
Implement prompt logprobs & Batched topk for computing logprobs (
#1328
)
2023-10-16 10:56:50 -07:00
quantization_utils
workaround of AWQ for Turing GPUs (
#1252
)
2023-10-10 19:48:16 -07:00
__init__.py
[Quality] Add code formatter and linter (
#326
)
2023-07-03 11:31:55 -07:00
input_metadata.py
remove useless statements (
#1408
)
2023-10-20 08:52:07 -07:00
model_loader.py
Fix the issue for AquilaChat2-* models (
#1339
)
2023-10-13 11:51:29 -07:00
utils.py
TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (
#1181
)
2023-10-02 15:36:09 -07:00
weight_utils.py
Add blacklist in model checkpoint (
#1325
)
2023-10-12 01:05:37 -07:00