Commit Graph

14 Commits

Author SHA1 Message Date
ferdinand.mom
7daefd31ee Merge branch 'main' into loading_big_model 2024-12-18 17:02:48 +00:00
ferdinand.mom
b647f58289 fix stuff to make it CPU compliants 2024-12-18 16:50:36 +00:00
ferdinand.mom
b57b8277d1 breaking: add new version of initi meta device but memory leaks 2024-12-17 15:46:16 +00:00
ferdinand.mom
859650a2c0 breaking: refactor loading big model to only download safetensors files 2024-12-17 15:46:09 +00:00
ferdinand.mom
b0ea5066ad small changes 2024-12-17 05:01:35 +00:00
ferdinand.mom
00ddbd9d2e raise Exception when not enough layers to distributed in rank + rename variable 2024-12-03 13:17:52 +00:00
ferdinand.mom
b80091e8ec set 1f1b by default 2024-12-03 10:10:12 +00:00
ferdinand.mom
86a0fc5e3d avois OS memory error with num_workers > 1 2024-12-02 18:33:35 +00:00
ferdinand.mom
32d8daa880 can now load big model through safetensors (sharded and single file) 2024-12-01 19:39:16 +00:00
ferdinand.mom
a44f905254 set num workers to 1 for now to avoid os memory error 2024-11-04 14:39:52 +00:00
ferdinand.mom
e19f74b715 add option for HF token 2024-11-04 14:39:12 +00:00
ferdinand.mom
7bfdf5f7d1 add fuse adam 2024-11-04 14:35:36 +00:00
ferdinand.mom
519b506b2b add option to switch between pp engine 2024-11-04 14:32:44 +00:00
ferdinand.mom
f74bff79e0 cleaning 2024-10-30 14:58:41 +00:00