flash-attention/flash_attn/ops/triton
2024-08-24 17:40:39 -07:00
..
__init__.py Create __init__.py for ops/triton dir (#516) 2023-09-05 11:29:03 -07:00
cross_entropy.py [CrossEntropy] Use online softmax to simplify implementation 2024-08-24 17:40:39 -07:00
k_activations.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
layer_norm.py [LayerNorm] Add option to write result to out and residual_out 2024-08-15 14:43:47 -07:00
linear.py [Rotary] Implement rotary in Triton 2023-09-03 02:51:58 -07:00
mlp.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
rotary.py Fix spurious re-compilations of rotary_kernel (#911) 2024-04-05 13:40:41 -07:00