vllm/vllm/attention
Michał Moskal 32881f3f31
[kernel] fix sliding window in prefix prefill Triton kernel (#4405)
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
2024-05-02 11:23:37 -07:00
..
backends [kernel] fix sliding window in prefix prefill Triton kernel (#4405) 2024-05-02 11:23:37 -07:00
ops [kernel] fix sliding window in prefix prefill Triton kernel (#4405) 2024-05-02 11:23:37 -07:00
__init__.py [Core][5/N] Fully working chunked prefill e2e (#3884) 2024-04-10 17:56:48 -07:00
layer.py [Misc]Add customized information for models (#4132) 2024-04-30 21:18:14 -07:00
selector.py [Misc] centralize all usage of environment variables (#4548) 2024-05-02 11:13:25 -07:00