This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
flash-attention
Watch
1
Star
0
Fork
0
You've already forked flash-attention
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
d79f9b41a8
flash-attention
/
flash_attn
/
ops
/
triton
History
Tri Dao
d79f9b41a8
[CrossEntropy] Use online softmax to simplify implementation
2024-08-24 17:40:39 -07:00
..
__init__.py
Create __init__.py for ops/triton dir (
#516
)
2023-09-05 11:29:03 -07:00
cross_entropy.py
[CrossEntropy] Use online softmax to simplify implementation
2024-08-24 17:40:39 -07:00
k_activations.py
Run isort and black on python files
2023-08-18 14:22:11 -07:00
layer_norm.py
[LayerNorm] Add option to write result to out and residual_out
2024-08-15 14:43:47 -07:00
linear.py
[Rotary] Implement rotary in Triton
2023-09-03 02:51:58 -07:00
mlp.py
Run isort and black on python files
2023-08-18 14:22:11 -07:00
rotary.py
Fix spurious re-compilations of
rotary_kernel
(
#911
)
2024-04-05 13:40:41 -07:00