* uneql rank. * trim. * enable passing in number of heads for each rank. * simplify. * simplify. * cleanup. * fix col parallel. * fix bug with row parallel. * fit out proj. * refac. * fix sharding logic. * refac sharding. * refac. * support multiple of. * make fn reuseable. * fix bug in dimensions. * scaffold. * test uneven heads. * fix test by adding barrier. * refac. * reuse code. * clean up. |
||
|---|---|---|
| .. | ||
| test_bert.py | ||
| test_falcon.py | ||
| test_gpt_generation_cg.py | ||
| test_gpt_generation_parallel.py | ||
| test_gpt_generation.py | ||
| test_gpt_neox.py | ||
| test_gpt_parallel.py | ||
| test_gpt.py | ||
| test_gptj.py | ||
| test_llama.py | ||
| test_opt.py | ||
| test_vit.py | ||