flash-attention/flash_attn/modules
2023-01-15 22:58:56 -08:00
..
__init__.py Add __init__.py files to subdirectories for installation 2022-11-17 16:55:44 -08:00
block.py [ViT] Fix extra norm_0, use new LN order in Block 2023-01-15 22:58:56 -08:00
embedding.py Reorder LN in Block, support OPT 2023-01-15 22:14:31 -08:00
mha.py [Gen] Make generation work with Tensor Parallel 2023-01-15 11:34:27 -08:00
mlp.py Implement TensorParallel for FusedDense and FusedDenseGeluDense 2022-12-24 11:48:56 -08:00