vllm/tpu at d4a2ac830291305f202a85e157bff3a07b58e616 - vllm

History

Alexander Matveev 7c7714d856 [Core][Bugfix][Perf] Introduce `MQLLMEngine` to avoid `asyncio` OH (#8157 ) Co-authored-by: Nick Hill <nickhill@us.ibm.com> Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com> Co-authored-by: Simon Mo <simon.mo@hey.com>		2024-09-18 13:56:58 +00:00
..
__init__.py	[torch.compile] avoid Dynamo guard evaluation overhead (#7898 )	2024-08-28 16:10:12 -07:00
test_compilation.py	[torch.compile] remove reset (#7975 )	2024-08-28 17:32:26 -07:00
test_custom_dispatcher.py	[Core][Bugfix][Perf] Introduce `MQLLMEngine` to avoid `asyncio` OH (#8157 )	2024-09-18 13:56:58 +00:00