flash-attention/flash_attn/ops
GAOXinyu 0cb595ad94
[bugfix] handle_x not define when using checkpoint_lvl = 2 (#502)
when using checkpoint_lvl=2, we all_gather_raw(x) without async_op=True.
So we don't need to wait for handle. Just skip.
2023-08-29 23:46:10 -07:00
..
triton Run isort and black on python files 2023-08-18 14:22:11 -07:00
__init__.py Add __init__.py files to subdirectories for installation 2022-11-17 16:55:44 -08:00
activations.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
fused_dense.py [bugfix] handle_x not define when using checkpoint_lvl = 2 (#502) 2023-08-29 23:46:10 -07:00
layer_norm.py Run isort and black on python files 2023-08-18 14:22:11 -07:00
rms_norm.py Run isort and black on python files 2023-08-18 14:22:11 -07:00