vllm/model_executor at 19682023b62c7ed00cee52a805dfa279dfc9c7a2 - vllm

History

Flávia Béo aa9078fa03 Adds method to read the pooling types from model's files (#9506 ) Signed-off-by: Flavia Beo <flavia.beo@ibm.com> Signed-off-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Max de Bayser <mbayser@br.ibm.com>		2024-11-07 08:42:40 +00:00
..
__init__.py	[CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425 )	2024-05-13 23:50:09 +09:00
conftest.py	[Frontend][Core] Move guided decoding params into sampling params (#8252 )	2024-10-01 09:34:25 +08:00
test_enabled_custom_ops.py	[torch.compile] Fine-grained CustomOp enabling mechanism (#9300 )	2024-10-17 18:36:37 +00:00
test_guided_processors.py	[Frontend][Core] Move guided decoding params into sampling params (#8252 )	2024-10-01 09:34:25 +08:00
test_model_load_with_params.py	Adds method to read the pooling types from model's files (#9506 )	2024-11-07 08:42:40 +00:00
weight_utils.py	[Core] Support offline use of local cache for models (#4374 )	2024-04-27 09:59:55 -07:00