vllm/e2e at 85657b56071b7c21586d88389c6e817f11c69e04 - vllm

History

Nick Hill 99dac099ab [Core][Doc] Default to multiprocessing for single-node distributed case (#5230 ) Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>		2024-06-11 11:10:41 -07:00
..
__init__.py	[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (#3951 )	2024-04-23 08:02:36 +00:00
conftest.py	[Core][Doc] Default to multiprocessing for single-node distributed case (#5230 )	2024-06-11 11:10:41 -07:00
test_compatibility.py	[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )	2024-05-16 00:53:51 -07:00
test_integration_dist.py	[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )	2024-05-16 00:53:51 -07:00
test_integration.py	[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )	2024-05-16 00:53:51 -07:00
test_logprobs.py	[Speculative decoding] Support target-model logprobs (#4378 )	2024-05-03 15:52:01 -07:00
test_multistep_correctness.py	[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )	2024-05-16 00:53:51 -07:00
test_ngram_correctness.py	[Dynamic Spec Decoding] Minor fix for disabling speculative decoding (#5000 )	2024-05-25 10:00:14 -07:00