This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
flash-attention
Watch
1
Star
0
Fork
0
You've already forked flash-attention
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
36bc29edf7
flash-attention
/
csrc
/
flash_attn
History
Tri Dao
36bc29edf7
Use int64_t instead of uint32_t in kernel_traits.h
2024-01-22 22:39:29 -08:00
..
src
Use int64_t instead of uint32_t in kernel_traits.h
2024-01-22 22:39:29 -08:00
flash_api.cpp
Add split-kv and M<->H swap to varlen forward decoding attention (
#754
)
2024-01-21 15:28:36 -08:00