This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
flash-attention
Watch
1
Star
0
Fork
0
You've already forked flash-attention
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
6a77a6da10
flash-attention
/
csrc
/
flash_attn
History
Tri Dao
6a77a6da10
Refactor gemm_cl to template on either __half or __nv_bfloat16
2022-07-09 23:18:26 -07:00
..
cutlass
@
319a389f42
Add Cutlass as submodule
2022-06-02 09:54:16 -07:00
src
Refactor gemm_cl to template on either __half or __nv_bfloat16
2022-07-09 23:18:26 -07:00
fmha_api.cpp
Apply dropout scaling to dQ and dK instead of to V (in bwd)
2022-07-03 17:53:37 -07:00