|
picotron
|
refactor tensor parallel
|
2024-12-01 03:43:04 +00:00 |
|
.gitignore
|
picotron top level folder
|
2024-11-04 15:29:26 +00:00 |
|
create_config.py
|
rename to grad_steps
|
2024-11-04 15:06:29 +00:00 |
|
extract_metrics.py
|
wip: load big model with meta device
|
2024-11-29 16:38:42 +00:00 |
|
README.md
|
Initial commit
|
2024-09-18 14:01:22 +02:00 |
|
setup.py
|
tesnsor parallel, will clean later
|
2024-10-18 05:13:44 +00:00 |
|
submit_slurm_jobs.py
|
add option for HF token
|
2024-11-04 14:39:12 +00:00 |
|
train.py
|
refactor checkpoint
|
2024-12-01 03:43:00 +00:00 |