vllm/tests/async_engine
Alexander Matveev 7c7714d856
[Core][Bugfix][Perf] Introduce MQLLMEngine to avoid asyncio OH (#8157)
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com>
Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-09-18 13:56:58 +00:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
api_server_async_engine.py [BugFix] Overhaul async request cancellation (#7111) 2024-08-07 13:21:41 +08:00
test_api_server.py [misc] remove engine_use_ray (#8126) 2024-09-11 18:23:36 -07:00
test_async_llm_engine.py [BugFix] Fix clean shutdown issues (#8492) 2024-09-16 09:33:46 -07:00
test_request_tracker.py [Core] Streamline stream termination in AsyncLLMEngine (#7336) 2024-08-09 07:06:36 +00:00