|
layers
|
Run isort and black on test files
|
2023-08-18 20:59:35 -07:00 |
|
losses
|
[CrossEntropy] Test longer sequences
|
2023-12-16 19:11:23 -08:00 |
|
models
|
Implement norm head for Baichuan2
|
2023-12-22 16:55:40 -08:00 |
|
modules
|
Run isort and black on test files
|
2023-08-18 20:59:35 -07:00 |
|
test_rotary.py
|
[Rotary] Implement varlen rotary
|
2023-09-03 17:57:10 -07:00 |