vllm/cacheflow
2023-06-17 17:25:21 +08:00
..
core Rename servers to engines (#152) 2023-06-17 17:25:21 +08:00
engine Rename servers to engines (#152) 2023-06-17 17:25:21 +08:00
entrypoints Rename servers to engines (#152) 2023-06-17 17:25:21 +08:00
model_executor Remove redundant code in ColumnParallelLinear (#146) 2023-06-10 21:25:11 -07:00
worker [PyPI] Packaging for PyPI distribution (#140) 2023-06-05 20:03:14 -07:00
__init__.py Rename servers to engines (#152) 2023-06-17 17:25:21 +08:00
block.py Add docstrings to some modules and classes (#100) 2023-05-14 22:32:38 -07:00
config.py Add docstrings for LLMServer and related classes and examples (#142) 2023-06-07 18:25:20 +08:00
logger.py Add a system logger (#85) 2023-05-08 23:03:35 -07:00
outputs.py OpenAI Compatible Frontend (#116) 2023-05-23 21:39:50 -07:00
sampling_params.py Enable LLaMA fast tokenizer (#132) 2023-05-28 02:51:42 -07:00
sequence.py Fix various issues of async servers (#135) 2023-06-05 23:44:50 +08:00
utils.py OpenAI Compatible Frontend (#116) 2023-05-23 21:39:50 -07:00