picotron/picotron
2024-11-20 01:58:44 +00:00
..
context_parallel better api when applying parallelism to train 2024-11-04 16:52:08 +00:00
data_parallel remove redundancy 2024-11-20 01:58:44 +00:00
pipeline_parallel Merge branch 'main' into add-grad-acc-pp 2024-11-04 18:42:40 +00:00
tensor_parallel better api when applying parallelism to train 2024-11-04 16:52:08 +00:00
__init__.py picotron top level folder 2024-11-04 15:29:26 +00:00
data.py separate dataloader from utils to data.py 2024-11-04 15:36:01 +00:00
model.py remove redundancy 2024-11-20 01:58:44 +00:00
process_group_manager.py picotron top level folder 2024-11-04 15:29:26 +00:00
utils.py mfu ref/typo 2024-11-18 17:57:02 +00:00