|
core
|
Fix type hints (#1427)
|
2023-10-20 08:50:47 -07:00 |
|
engine
|
Support SqueezeLLM (#1326)
|
2023-10-21 23:14:59 -07:00 |
|
model_executor
|
Delay GPU->CPU sync in sampling (#1337)
|
2023-10-30 09:01:34 -07:00 |
|
worker
|
Delay GPU->CPU sync in sampling (#1337)
|
2023-10-30 09:01:34 -07:00 |
|
__init__.py
|
Bump up the version to v0.2.1 (#1355)
|
2023-10-16 12:58:57 -07:00 |
|
block.py
|
[Quality] Add code formatter and linter (#326)
|
2023-07-03 11:31:55 -07:00 |
|
config.py
|
Support SqueezeLLM (#1326)
|
2023-10-21 23:14:59 -07:00 |
|
logger.py
|
[Quality] Add code formatter and linter (#326)
|
2023-07-03 11:31:55 -07:00 |
|
sampling_params.py
|
Support repetition_penalty (#1424)
|
2023-10-29 10:02:41 -07:00 |