vllm/vllm/model_executor/layers
Antoni Baum a62de9ecfd
Fix wrong dtype in PagedAttentionWithALiBi bias (#996)
---------

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>
2023-09-09 14:58:35 -07:00
..
__init__.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
activation.py Implement approximate GELU kernels (#828) 2023-08-23 07:43:21 +09:00
attention.py Fix wrong dtype in PagedAttentionWithALiBi bias (#996) 2023-09-09 14:58:35 -07:00
layernorm.py Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00
sampler.py Align vLLM's beam search implementation with HF generate (#857) 2023-09-04 17:29:42 -07:00