flash-attention/tests
Grigory Sizov f816dee63c
Support unpadded LSE layout (#970)
* Support unpadded LSE layout.

Co-authored-by: Xinfeng Xie <xfxie.ceca@gmail.com>
Co-authored-by: Jianyu Huang <hjyahead@gmail.com>

* Cleanup

* Fix unpadded LSE on split-kv path

* Fix formatting and comments

* Fix inline vs forceinline

---------

Co-authored-by: Xinfeng Xie <xfxie.ceca@gmail.com>
Co-authored-by: Jianyu Huang <hjyahead@gmail.com>
2024-06-27 02:38:13 -07:00
..
layers Run isort and black on test files 2023-08-18 20:59:35 -07:00
losses return z_loss (#768) 2024-01-21 15:23:41 -08:00
models Add test for BTLM init 2023-12-25 15:16:27 -08:00
modules Run isort and black on test files 2023-08-18 20:59:35 -07:00
ops [LayerNorm] Rename layernorm.py -> layer_norm.py 2024-01-05 00:21:03 -08:00
pyproject.toml Move pyproject.toml to flash-attn and tests dir to avoid PEP 517 2023-08-25 15:05:28 -07:00
test_flash_attn.py Support unpadded LSE layout (#970) 2024-06-27 02:38:13 -07:00
test_rotary.py Fix spurious re-compilations of rotary_kernel (#911) 2024-04-05 13:40:41 -07:00