vllm/vllm at c5f7740d89737744438e08c26da1d4fbadcb3893 - vllm

History

Woosuk Kwon c5f7740d89 Bump up to v0.2.2 (#1689 )		2023-11-18 21:57:07 -08:00
..
core	Fix type hints (#1427 )	2023-10-20 08:50:47 -07:00
engine	[Minor] Fix duplication of ignored seq group in engine step (#1666 )	2023-11-16 13:11:41 -08:00
entrypoints	Support download models from www.modelscope.cn (#1588 )	2023-11-17 20:38:31 -08:00
model_executor	[Optimization] Implement fused add rmsnorm (#1667 )	2023-11-18 18:18:02 -08:00
transformers_utils	Revert `MptConfig` to `MPTConfig` (#1668 )	2023-11-16 01:19:39 -08:00
worker	Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546 )	2023-11-08 14:19:12 -08:00
__init__.py	Bump up to v0.2.2 (#1689 )	2023-11-18 21:57:07 -08:00
block.py	[Quality] Add code formatter and linter (#326 )	2023-07-03 11:31:55 -07:00
config.py	Fix warning msg on quantization (#1715 )	2023-11-18 21:49:55 -08:00
logger.py	[Fix] Fix duplicated logging messages (#1524 )	2023-10-31 09:04:47 -07:00
outputs.py	docs: add description (#1553 )	2023-11-03 09:14:52 -07:00
py.typed	Add py.typed so consumers of vLLM can get type checking (#1509 )	2023-10-30 14:50:47 -07:00
sampling_params.py	Support Min P Sampler (#1642 )	2023-11-17 16:20:49 -08:00
sequence.py	[BugFix] Define `__eq__` in SequenceGroupOutputs (#1389 )	2023-10-17 01:09:44 -07:00
utils.py	Allocate more shared memory to attention kernel (#1154 )	2023-09-26 22:27:13 -07:00