This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
flash-attention
Watch
1
Star
0
Fork
0
You've already forked flash-attention
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
2e29dacf0c
flash-attention
/
tests
History
Tri Dao
3f7d5786ba
Pass alibi slopes to flash_attn_with_kvcache during generation
2023-12-24 20:31:59 -08:00
..
layers
Run isort and black on test files
2023-08-18 20:59:35 -07:00
losses
[CrossEntropy] Test longer sequences
2023-12-16 19:11:23 -08:00
models
Pass alibi slopes to flash_attn_with_kvcache during generation
2023-12-24 20:31:59 -08:00
modules
Run isort and black on test files
2023-08-18 20:59:35 -07:00
ops
[LayerNorm] Implement dropout in fused residual + LN/RMSNorm
2023-12-19 16:26:07 -08:00
pyproject.toml
Move pyproject.toml to flash-attn and tests dir to avoid PEP 517
2023-08-25 15:05:28 -07:00
test_flash_attn.py
Implement deterministic backward (thanks to Meituan)
2023-12-23 17:57:36 -08:00
test_rotary.py
[Rotary] Implement varlen rotary
2023-09-03 17:57:10 -07:00