Go to file
2024-12-13 13:29:11 +00:00
picotron stop iteration fix. recreate a new dataloder 2024-12-13 13:29:11 +00:00
template set num workers to 1 for now to avoid os memory error 2024-11-04 14:39:52 +00:00
tests stop iteration fix. recreate a new dataloder 2024-12-13 13:29:11 +00:00
.gitignore picotron top level folder 2024-11-04 15:29:26 +00:00
create_config.py rename to grad_steps 2024-11-04 15:06:29 +00:00
README.md Initial commit 2024-09-18 14:01:22 +02:00
requirements.txt fix requirements to avoid drop in throughput 2024-11-04 14:33:07 +00:00
setup.py tesnsor parallel, will clean later 2024-10-18 05:13:44 +00:00
submit_slurm_jobs.py add option for HF token 2024-11-04 14:39:12 +00:00
train.py add mfu, get number of parameters 2024-11-18 17:36:51 +00:00

picotron