This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
flash-attention
Watch
1
Star
0
Fork
0
You've already forked flash-attention
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
e07aa036db
flash-attention
/
csrc
/
flash_attn
History
BoxiangW
e07aa036db
Support flash attention 2 with causal masking when KV's seq length is longer than Q's seq length. (
#436
)
2023-08-24 16:42:34 -07:00
..
src
Support flash attention 2 with causal masking when KV's seq length is longer than Q's seq length. (
#436
)
2023-08-24 16:42:34 -07:00
flash_api.cpp
Enable CUDA graphs (
#386
)
2023-07-27 16:11:34 -07:00