vllm/worker at c0c2335ce027486d254c31f665ce00d7db427d22 - vllm

History

Kunshang Ji 96b6f475dd Remove hardcoded `device="cuda"` to support more devices (#2503 ) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>		2024-02-01 15:46:39 -08:00
..
spec_decode	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
__init__.py	[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )	2024-01-21 16:31:47 -08:00
test_model_runner.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00