vllm/model_executor at a62de9ecfdc6bfc80db3d0d1136124c74c94ea04 - vllm

History

Antoni Baum a62de9ecfd Fix wrong dtype in PagedAttentionWithALiBi bias (#996 ) --------- Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>		2023-09-09 14:58:35 -07:00
..
layers	Fix wrong dtype in PagedAttentionWithALiBi bias (#996 )	2023-09-09 14:58:35 -07:00
models	Enable safetensors loading for all models (#974 )	2023-09-07 15:49:52 -07:00
parallel_utils	Add Falcon support (new) (#592 )	2023-08-02 14:04:39 -07:00
__init__.py	[Quality] Add code formatter and linter (#326 )	2023-07-03 11:31:55 -07:00
input_metadata.py	Add support for BLOOM (#331 )	2023-07-03 13:12:35 -07:00
model_loader.py	Enable safetensors loading for all models (#974 )	2023-09-07 15:49:52 -07:00
utils.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
weight_utils.py	Enable safetensors loading for all models (#974 )	2023-09-07 15:49:52 -07:00