|
src
|
fix DP integation within PP (1f1b)
|
2024-11-01 20:08:48 +00:00 |
|
template
|
change distributed option to pass to multi-node
|
2024-11-02 02:18:49 +00:00 |
|
.gitignore
|
tesnsor parallel, will clean later
|
2024-10-18 05:13:44 +00:00 |
|
convert_picotron_to_hf.py
|
refactor organisation
|
2024-10-10 15:12:14 +00:00 |
|
create_config.py
|
add fuse adam
|
2024-11-02 01:38:14 +00:00 |
|
generate.py
|
various fix (modeling, dataloader, cpu load)
|
2024-10-18 14:33:46 +00:00 |
|
README.md
|
Initial commit
|
2024-09-18 14:01:22 +02:00 |
|
setup.py
|
tesnsor parallel, will clean later
|
2024-10-18 05:13:44 +00:00 |
|
train.py
|
change distributed option to pass to multi-node
|
2024-11-02 02:18:49 +00:00 |
|
utils.py
|
change distributed option to pass to multi-node
|
2024-11-02 02:18:49 +00:00 |