vllm/tests/model_executor
Flávia Béo aa9078fa03
Adds method to read the pooling types from model's files (#9506)
Signed-off-by: Flavia Beo <flavia.beo@ibm.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Co-authored-by: Max de Bayser <mbayser@br.ibm.com>
2024-11-07 08:42:40 +00:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
conftest.py [Frontend][Core] Move guided decoding params into sampling params (#8252) 2024-10-01 09:34:25 +08:00
test_enabled_custom_ops.py [torch.compile] Fine-grained CustomOp enabling mechanism (#9300) 2024-10-17 18:36:37 +00:00
test_guided_processors.py [Frontend][Core] Move guided decoding params into sampling params (#8252) 2024-10-01 09:34:25 +08:00
test_model_load_with_params.py Adds method to read the pooling types from model's files (#9506) 2024-11-07 08:42:40 +00:00
weight_utils.py [Core] Support offline use of local cache for models (#4374) 2024-04-27 09:59:55 -07:00