|
bert.py
|
Reorder LN in Block, support OPT
|
2023-01-15 22:14:31 -08:00 |
|
gpt.py
|
Reorder LN in Block, support OPT
|
2023-01-15 22:14:31 -08:00 |
|
opt.py
|
Reorder LN in Block, support OPT
|
2023-01-15 22:14:31 -08:00 |
|
vit.py
|
[ViT] Use dropout_add_ln for the 1st layer norm
|
2022-11-23 12:48:56 -08:00 |