From 450b64fe4447d044877a58958a6b08b9b82d21dc Mon Sep 17 00:00:00 2001
From: Tri Dao <tridpq@gmail.com>
Date: Mon, 27 Jun 2022 13:50:16 -0700
Subject: [PATCH] Add README section on issues

---
 README.md | 10 ++++++++++
 1 file changed, 10 insertions(+)

diff --git a/README.md b/README.md
index 4435de1..8d46d8f 100644
--- a/README.md
+++ b/README.md
@@ -104,6 +104,16 @@ T4 GPUs are commonly used for inference, so we also measure speedup on the forwa
 
 We see speedups between 2.5x-4.5x on the forward pass.
 
+## When you encounter issues
+
+This alpha release of FlashAttention contains code written for a research
+project to validate ideas on speeding up attention. 
+We have tested it on several models (BERT, GPT2, ViT). 
+However, there might still be bugs in the implementation that we hope to iron
+out in the next few months.
+
+If you encounter any of these bugs, please open a respective GitHub Issue!
+
 ## Acknowledgments
 Our implementation uses Apex's
 [FMHA](https://github.com/NVIDIA/apex/tree/master/apex/contrib/csrc/fmha) code