This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
picotron
Watch
1
Star
0
Fork
0
You've already forked picotron
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
72
Commits
1
Branch
0
Tags
3.1
MiB
Python
98%
Shell
2%
e7b4722160
Go to file
HTTPS
Download ZIP
Download TAR.GZ
Download BUNDLE
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Cite this repository
APA
BibTeX
Cancel
ferdinand.mom
e7b4722160
remove unecessary files
2024-11-04 15:27:53 +00:00
src
some dp renaming
2024-11-04 14:48:12 +00:00
template
set num workers to 1 for now to avoid os memory error
2024-11-04 14:39:52 +00:00
.gitignore
tesnsor parallel, will clean later
2024-10-18 05:13:44 +00:00
create_config.py
rename to grad_steps
2024-11-04 15:06:29 +00:00
model.py
fix spliting input twice for context parallel (done in dataloader)
2024-10-30 15:43:42 +00:00
README.md
Initial commit
2024-09-18 14:01:22 +02:00
requirements.txt
fix requirements to avoid drop in throughput
2024-11-04 14:33:07 +00:00
setup.py
tesnsor parallel, will clean later
2024-10-18 05:13:44 +00:00
submit_slurm_jobs.py
add option for HF token
2024-11-04 14:39:12 +00:00
train.py
rename to grad_steps
2024-11-04 15:06:29 +00:00
utils.py
rename to grad_steps
2024-11-04 15:06:29 +00:00
README.md
picotron