From b78f5a392d3232cf7a220b8cb6e1a58b71d8f344 Mon Sep 17 00:00:00 2001
From: Tri Dao <tridpq@gmail.com>
Date: Thu, 15 Dec 2022 19:49:04 -0800
Subject: [PATCH] [Docs] Mention Megatron-LM

---
 usage.md | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/usage.md b/usage.md
index 5b6f24c..0d9c052 100644
--- a/usage.md
+++ b/usage.md
@@ -18,6 +18,9 @@ PR or email us. We'd very much like to hear from you!
 - Microsoft's [DeepSpeed](https://github.com/microsoft/DeepSpeed):
   FlashAttention is [integrated](https://github.com/microsoft/DeepSpeed/blob/ec13da6ba7cabc44bb4745a64a208b8580792954/deepspeed/ops/transformer/inference/triton_ops.py) into DeepSpeed's inference engine.
 
+- Nvidia's [Megatron-LM](https://github.com/NVIDIA/Megatron-LM/pull/267). This
+  library is a popular framework on training large transformer language models at scale.
+
 - MosaicML [Composer](https://github.com/mosaicml/composer)
   [library](https://www.mosaicml.com/blog/gpt-3-quality-for-500k). Composer is a
   library for efficient neural network training.