|
picotron
|
better api when applying parallelism to train
|
2024-11-04 16:52:08 +00:00 |
|
.gitignore
|
picotron top level folder
|
2024-11-04 15:29:26 +00:00 |
|
create_config.py
|
rename to grad_steps
|
2024-11-04 15:06:29 +00:00 |
|
README.md
|
Initial commit
|
2024-09-18 14:01:22 +02:00 |
|
setup.py
|
tesnsor parallel, will clean later
|
2024-10-18 05:13:44 +00:00 |
|
submit_slurm_jobs.py
|
add option for HF token
|
2024-11-04 14:39:12 +00:00 |
|
train.py
|
some cleaning in train
|
2024-11-04 16:54:49 +00:00 |