ferdinand.mom
|
7daefd31ee
|
Merge branch 'main' into loading_big_model
|
2024-12-18 17:02:48 +00:00 |
|
ferdinand.mom
|
b647f58289
|
fix stuff to make it CPU compliants
|
2024-12-18 16:50:36 +00:00 |
|
ferdinand.mom
|
b57b8277d1
|
breaking: add new version of initi meta device but memory leaks
|
2024-12-17 15:46:16 +00:00 |
|
ferdinand.mom
|
859650a2c0
|
breaking: refactor loading big model to only download safetensors files
|
2024-12-17 15:46:09 +00:00 |
|
ferdinand.mom
|
b0ea5066ad
|
small changes
|
2024-12-17 05:01:35 +00:00 |
|
ferdinand.mom
|
00ddbd9d2e
|
raise Exception when not enough layers to distributed in rank + rename variable
|
2024-12-03 13:17:52 +00:00 |
|
ferdinand.mom
|
b80091e8ec
|
set 1f1b by default
|
2024-12-03 10:10:12 +00:00 |
|
ferdinand.mom
|
86a0fc5e3d
|
avois OS memory error with num_workers > 1
|
2024-12-02 18:33:35 +00:00 |
|
ferdinand.mom
|
32d8daa880
|
can now load big model through safetensors (sharded and single file)
|
2024-12-01 19:39:16 +00:00 |
|
ferdinand.mom
|
a44f905254
|
set num workers to 1 for now to avoid os memory error
|
2024-11-04 14:39:52 +00:00 |
|
ferdinand.mom
|
e19f74b715
|
add option for HF token
|
2024-11-04 14:39:12 +00:00 |
|
ferdinand.mom
|
7bfdf5f7d1
|
add fuse adam
|
2024-11-04 14:35:36 +00:00 |
|
ferdinand.mom
|
519b506b2b
|
add option to switch between pp engine
|
2024-11-04 14:32:44 +00:00 |
|
ferdinand.mom
|
f74bff79e0
|
cleaning
|
2024-10-30 14:58:41 +00:00 |
|