Go to file
ferdinand.mom f74bff79e0 cleaning
2024-10-30 14:58:41 +00:00
src small change 2024-10-30 14:58:41 +00:00
template cleaning 2024-10-30 14:58:41 +00:00
.gitignore tesnsor parallel, will clean later 2024-10-18 05:13:44 +00:00
convert_hf_to_picotron.py various fix (modeling, dataloader, cpu load) 2024-10-18 14:33:46 +00:00
convert_picotron_to_hf.py refactor organisation 2024-10-10 15:12:14 +00:00
create_config.py cleaning 2024-10-30 14:58:41 +00:00
generate.py various fix (modeling, dataloader, cpu load) 2024-10-18 14:33:46 +00:00
model.py add assert in TensorParallel for num_attention_heads and key_values_heads 2024-10-30 14:58:41 +00:00
README.md Initial commit 2024-09-18 14:01:22 +02:00
requirements.txt add wandb support 2024-09-25 14:19:16 +00:00
setup.py tesnsor parallel, will clean later 2024-10-18 05:13:44 +00:00
submit_slurm_jobs.py cleaning 2024-10-30 14:58:41 +00:00
train.py add wandb 2024-10-30 14:58:41 +00:00
utils.py better config creation 2024-10-30 14:58:41 +00:00

picotron