This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
picotron
Watch
1
Star
0
Fork
0
You've already forked picotron
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
abd1edf9f9
picotron
/
src
/
parallel
History
ferdinand.mom
abd1edf9f9
all_reduce loss across pp/dp ranks + base_parallel
2024-10-18 15:51:17 +00:00
..
data_parallel
use global pgm for ddp
2024-10-18 14:59:26 +00:00
tensor_parallel
remove merged qkv
2024-10-18 14:59:04 +00:00
context_parallel.py
all_reduce loss across pp/dp ranks + base_parallel
2024-10-18 15:51:17 +00:00
pipeline_parallel.py
all_reduce loss across pp/dp ranks + base_parallel
2024-10-18 15:51:17 +00:00