flash-attention/flash_attn/ops/triton
2024-09-11 10:39:58 -07:00
..
__init__.py Create __init__.py for ops/triton dir (#516) 2023-09-05 11:29:03 -07:00
cross_entropy.py [Rotary] Support qkv block layout from GQA 2024-09-11 10:39:58 -07:00
k_activations.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
layer_norm.py [LayerNorm] Add option to write result to out and residual_out 2024-08-15 14:43:47 -07:00
linear.py [Rotary] Implement rotary in Triton 2023-09-03 02:51:58 -07:00
mlp.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
rotary.py [Rotary] Support qkv block layout from GQA 2024-09-11 10:39:58 -07:00