This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
picotron
Watch
1
Star
0
Fork
0
You've already forked picotron
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
82
Commits
1
Branch
0
Tags
3.1
MiB
Python
98%
Shell
2%
ccf2a0a4ac
Go to file
HTTPS
Download ZIP
Download TAR.GZ
Download BUNDLE
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Cite this repository
APA
BibTeX
Cancel
Ferdinand Mom
ccf2a0a4ac
Merge pull request
#7
from huggingface/refactoring
...
Refactoring
2024-11-04 19:39:34 +01:00
picotron
better api when applying parallelism to train
2024-11-04 16:52:08 +00:00
template
set num workers to 1 for now to avoid os memory error
2024-11-04 14:39:52 +00:00
.gitignore
picotron top level folder
2024-11-04 15:29:26 +00:00
create_config.py
rename to grad_steps
2024-11-04 15:06:29 +00:00
README.md
Initial commit
2024-09-18 14:01:22 +02:00
requirements.txt
fix requirements to avoid drop in throughput
2024-11-04 14:33:07 +00:00
setup.py
tesnsor parallel, will clean later
2024-10-18 05:13:44 +00:00
submit_slurm_jobs.py
add option for HF token
2024-11-04 14:39:12 +00:00
train.py
some fix
2024-11-04 16:57:00 +00:00
README.md
picotron