vllm/vllm
2023-11-18 21:57:07 -08:00
..
core Fix type hints (#1427) 2023-10-20 08:50:47 -07:00
engine [Minor] Fix duplication of ignored seq group in engine step (#1666) 2023-11-16 13:11:41 -08:00
entrypoints Support download models from www.modelscope.cn (#1588) 2023-11-17 20:38:31 -08:00
model_executor [Optimization] Implement fused add rmsnorm (#1667) 2023-11-18 18:18:02 -08:00
transformers_utils Revert MptConfig to MPTConfig (#1668) 2023-11-16 01:19:39 -08:00
worker Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546) 2023-11-08 14:19:12 -08:00
__init__.py Bump up to v0.2.2 (#1689) 2023-11-18 21:57:07 -08:00
block.py [Quality] Add code formatter and linter (#326) 2023-07-03 11:31:55 -07:00
config.py Fix warning msg on quantization (#1715) 2023-11-18 21:49:55 -08:00
logger.py [Fix] Fix duplicated logging messages (#1524) 2023-10-31 09:04:47 -07:00
outputs.py docs: add description (#1553) 2023-11-03 09:14:52 -07:00
py.typed Add py.typed so consumers of vLLM can get type checking (#1509) 2023-10-30 14:50:47 -07:00
sampling_params.py Support Min P Sampler (#1642) 2023-11-17 16:20:49 -08:00
sequence.py [BugFix] Define __eq__ in SequenceGroupOutputs (#1389) 2023-10-17 01:09:44 -07:00
utils.py Allocate more shared memory to attention kernel (#1154) 2023-09-26 22:27:13 -07:00