flash-attention

History

Tri Dao ef085cfcda [ViT] Fix extra norm_0, use new LN order in Block		2023-01-15 22:58:56 -08:00
..
__init__.py	Add __init__.py files to subdirectories for installation	2022-11-17 16:55:44 -08:00
block.py	[ViT] Fix extra norm_0, use new LN order in Block	2023-01-15 22:58:56 -08:00
embedding.py	Reorder LN in Block, support OPT	2023-01-15 22:14:31 -08:00
mha.py	[Gen] Make generation work with Tensor Parallel	2023-01-15 11:34:27 -08:00
mlp.py	Implement TensorParallel for FusedDense and FusedDenseGeluDense	2022-12-24 11:48:56 -08:00