| csrc/stream_attn | ||
| bert_padding.py | ||
| README.md | ||
| rotary.py | ||
| stream_attn_interface.py | ||
| stream_blocksparse_attn_interface.py | ||
| streaming_attention.py | ||
| streaming_blocksparse_attention.py | ||
Alpha release of FlashAttention.
To compile:
cd csrc/stream_attn
python setup.py install
Interface: streaming_attention.py
Contact: trid@stanford.edu