Merge pull request #313 from philipturner/patch-1

Metal FlashAttention
This commit is contained in:
Tri Dao 2023-07-15 20:36:48 -04:00 committed by GitHub
commit 01c40dacc4
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -123,3 +123,5 @@ yields the fastest BERT training on cloud instances in MLPerf training 2.0 (June
- [Jax](https://github.com/google/jax): an [implementation](https://github.com/lucidrains/flash-attention-jax)
in Jax by [lucidrains](https://github.com/lucidrains/).
- [Metal](https://developer.apple.com/metal): an [implementation](https://github.com/philipturner/metal-flash-attention) in Metal by Philip Turner. This ports FlashAttention to mobile GPU architectures such as Apple silicon.