vllm/worker at 30a2e8074246e11a1452ab5e84a7be65ecac6119 - vllm

History

wangshuai09 3ddbe25502 [Hardware][CPU] using current_platform.is_cpu (#9536 )		2024-10-22 00:50:43 -07:00
..
__init__.py	[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )	2024-01-21 16:31:47 -08:00
test_encoder_decoder_model_runner.py	[Hardware][CPU] using current_platform.is_cpu (#9536 )	2024-10-22 00:50:43 -07:00
test_model_input.py	[Core] Add `AttentionState` abstraction (#7663 )	2024-08-20 18:50:45 +00:00
test_model_runner.py	[Core] Factor out common code in `SequenceData` and `Sequence` (#8675 )	2024-09-21 02:30:39 +00:00
test_profile.py	🐛 fix torch memory profiling (#9516 )	2024-10-18 21:25:19 -04:00
test_swap.py	[Core] Pipeline Parallel Support (#4412 )	2024-07-02 10:58:08 -07:00