|
block.py
|
[MLP] Add ParallelMLP
|
2023-07-22 23:45:51 -07:00 |
|
embedding.py
|
Reorder LN in Block, support OPT
|
2023-01-15 22:14:31 -08:00 |
|
mha.py
|
[MHA] Implement MQA/GQA
|
2023-07-23 00:06:58 -07:00 |
|
mlp.py
|
Implement ParallelGatedMlp (#251)
|
2023-07-26 12:14:15 -07:00 |