This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
picotron
Watch
1
Star
0
Fork
0
You've already forked picotron
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
98
Commits
1
Branch
0
Tags
3.1
MiB
Python
98%
Shell
2%
a2ce795837
Go to file
HTTPS
Download ZIP
Download TAR.GZ
Download BUNDLE
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Cite this repository
APA
BibTeX
Cancel
Ferdinand Mom
a2ce795837
Merge pull request
#8
from huggingface/add_mfu
...
add mfu, get number of parameters
2024-11-18 13:45:32 -04:00
picotron
add mfu, get number of parameters
2024-11-18 17:36:51 +00:00
template
set num workers to 1 for now to avoid os memory error
2024-11-04 14:39:52 +00:00
.gitignore
picotron top level folder
2024-11-04 15:29:26 +00:00
create_config.py
rename to grad_steps
2024-11-04 15:06:29 +00:00
README.md
Initial commit
2024-09-18 14:01:22 +02:00
requirements.txt
fix requirements to avoid drop in throughput
2024-11-04 14:33:07 +00:00
setup.py
tesnsor parallel, will clean later
2024-10-18 05:13:44 +00:00
submit_slurm_jobs.py
add option for HF token
2024-11-04 14:39:12 +00:00
train.py
add mfu, get number of parameters
2024-11-18 17:36:51 +00:00
README.md
picotron