vllm/tests/tpu
Alexander Matveev 7c7714d856
[Core][Bugfix][Perf] Introduce MQLLMEngine to avoid asyncio OH (#8157)
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com>
Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-09-18 13:56:58 +00:00
..
__init__.py [torch.compile] avoid Dynamo guard evaluation overhead (#7898) 2024-08-28 16:10:12 -07:00
test_compilation.py [torch.compile] remove reset (#7975) 2024-08-28 17:32:26 -07:00
test_custom_dispatcher.py [Core][Bugfix][Perf] Introduce MQLLMEngine to avoid asyncio OH (#8157) 2024-09-18 13:56:58 +00:00