vllm/vllm/spec_decode
2024-06-15 04:45:31 +00:00
..
__init__.py [Bugfix] Add __init__.py files for vllm/core/block/ and vllm/spec_decode/ (#3798) 2024-04-02 12:35:31 -07:00
batch_expansion.py [Misc] Various simplifications and typing fixes (#5368) 2024-06-11 10:29:02 +08:00
interfaces.py [Speculative Decoding] Add ProposerWorkerBase abstract class (#5252) 2024-06-05 14:53:05 -07:00
metrics.py [Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (#3951) 2024-04-23 08:02:36 +00:00
multi_step_worker.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
ngram_worker.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
proposer_worker_base.py [Speculative Decoding] Add ProposerWorkerBase abstract class (#5252) 2024-06-05 14:53:05 -07:00
spec_decode_worker.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
top1_proposer.py [Misc] Various simplifications and typing fixes (#5368) 2024-06-11 10:29:02 +08:00
util.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00