|
layers
|
[Rotary] Don't store inv_freq in state_dict
|
2023-07-22 23:52:42 -07:00 |
|
models
|
[Rotary] Don't store inv_freq in state_dict
|
2023-07-22 23:52:42 -07:00 |
|
modules
|
[MLP] Add ParallelMLP
|
2023-07-22 23:45:51 -07:00 |
|
utils
|
[Gen] Minor tweak to allocate_inference_cache
|
2023-04-21 11:56:47 -07:00 |
|
__init__.py
|
FlashAttention-2 release
|
2023-07-17 06:21:34 -07:00 |
|
bert_padding.py
|
remove numpy dependency
|
2022-10-06 19:17:15 +02:00 |
|
flash_attn_interface.py
|
Make sure dout is contiguous
|
2023-07-17 21:54:44 -07:00 |
|
flash_blocksparse_attention.py
|
Rename src -> flash_attn
|
2022-06-01 18:50:26 -07:00 |