vllm/model_executor at d189170b6c5a143e493c3f5cb7e8c976e8da62c7 - vllm

History

Wang Ran (汪然) d189170b6c remove useless statements (#1408 )		2023-10-20 08:52:07 -07:00
..
layers	Change scheduler & input tensor shape (#1381 )	2023-10-16 17:48:42 -07:00
models	Fix the issue for AquilaChat2-* models (#1339 )	2023-10-13 11:51:29 -07:00
parallel_utils	Implement prompt logprobs & Batched topk for computing logprobs (#1328 )	2023-10-16 10:56:50 -07:00
quantization_utils	workaround of AWQ for Turing GPUs (#1252 )	2023-10-10 19:48:16 -07:00
__init__.py	[Quality] Add code formatter and linter (#326 )	2023-07-03 11:31:55 -07:00
input_metadata.py	remove useless statements (#1408 )	2023-10-20 08:52:07 -07:00
model_loader.py	Fix the issue for AquilaChat2-* models (#1339 )	2023-10-13 11:51:29 -07:00
utils.py	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
weight_utils.py	Add blacklist in model checkpoint (#1325 )	2023-10-12 01:05:37 -07:00