flash-attention/triton at 6fc1e07da22a344b6f0927b9e21e0eafb31fda99 - flash-attention - Gitea: Git with a cup of tea

squall/flash-attention

History

Tri Dao 96d10f6545 Implement LLaMa		2023-04-18 21:51:35 -07:00
..
k_activations.py	Add GPT and ViT models	2022-11-13 22:30:23 -08:00
linear.py	Add GPT and ViT models	2022-11-13 22:30:23 -08:00
mlp.py	Implement LLaMa	2023-04-18 21:51:35 -07:00