Go to file
2023-03-01 15:02:19 -08:00
cacheflow Implement single_query_cached_kv_attention kernel (#3) 2023-03-01 15:02:19 -08:00
csrc Implement single_query_cached_kv_attention kernel (#3) 2023-03-01 15:02:19 -08:00
tests/kernels Implement single_query_cached_kv_attention kernel (#3) 2023-03-01 15:02:19 -08:00
.gitignore Add gitignore 2023-02-16 07:47:21 +00:00
README.md Add README 2023-02-24 12:04:49 +00:00
server.py Clean up the server script 2023-02-24 11:56:21 +00:00
setup.py Implement single_query_cached_kv_attention kernel (#3) 2023-03-01 15:02:19 -08:00

CacheFlow

Installation

pip install cmake torch transformers
pip install -e .

Run

python server.py