Mention DeepSpeed inference in usage.md

2022-11-14 10:01:16 -08:00 · 2022-11-14 10:01:16 -08:00 · b0ed0a73fd
commit b0ed0a73fd
parent 25387b24c1
1 changed files with 3 additions and 0 deletions
--- a/usage.md
+++ b/usage.md
@ -15,6 +15,9 @@ PR or email us. We'd very much like to hear from you!
  [On-going](https://github.com/huggingface/transformers/pull/18439), blogpost
  coming soon.

+- Microsoft's [DeepSpeed](https://github.com/microsoft/DeepSpeed):
+  FlashAttention is [integrated](https://github.com/microsoft/DeepSpeed/blob/ec13da6ba7cabc44bb4745a64a208b8580792954/deepspeed/ops/transformer/inference/triton_ops.py) into DeepSpeed's inference engine.
+
 - MosaicML [Composer](https://github.com/mosaicml/composer)
  [library](https://www.mosaicml.com/blog/gpt-3-quality-for-500k). Composer is a
  library for efficient neural network training.