flash-attention/examples/inference/README.md
2023-09-22 02:31:00 -07:00

49 B

Example of LLM inference using FlashAttention