vllm/vllm/entrypoints
Joe Runde 062c89e7c9
[Frontend][Core] Move guided decoding params into sampling params (#8252)
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
2024-10-01 09:34:25 +08:00
..
openai [Frontend][Core] Move guided decoding params into sampling params (#8252) 2024-10-01 09:34:25 +08:00
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
api_server.py [Bugfix] Config got an unexpected keyword argument 'engine' (#8556) 2024-09-20 14:00:45 -07:00
chat_utils.py [Frontend] Added support for HF's new continue_final_message parameter (#8942) 2024-09-29 17:59:47 +00:00
launcher.py [Core][Bugfix][Perf] Introduce MQLLMEngine to avoid asyncio OH (#8157) 2024-09-18 13:56:58 +00:00
llm.py [Frontend][Core] Move guided decoding params into sampling params (#8252) 2024-10-01 09:34:25 +08:00
logger.py [Frontend] Refactor prompt processing (#4028) 2024-07-22 10:13:53 -07:00