Automatic parallelism strategy generation with minimal memory redundancy
Regular Papers|Updated:2025-03-13
|
Automatic parallelism strategy generation with minimal memory redundancy
Enhanced Publication
最小化内存冗余的自动并行策略生成方法
“In the field of large-scale deep learning, a novel algorithm has been proposed to generate optimal parallelism strategies with minimal memory redundancy. Expert researchers have formulated the parallelism strategy search problem into an integer linear programming problem and used an efficient solver to find minimal-memory intra-operator parallelism strategies. This approach achieves memory savings of up to 67% compared to the latest Megatron-LM strategies.”
Frontiers of Information Technology & Electronic EngineeringVol. 26, Issue 1, Pages: 109-118(2025)
Affiliations:
National Key Laboratory of Parallel and Distributed Computing, National University of Defense Technology, Changsha 410000, China