Your Location:
Home >
Browse articles >
Automatic parallelism strategy generation with minimal memory redundancy
Regular Papers | Updated:2025-03-13
    • Automatic parallelism strategy generation with minimal memory redundancy

      Enhanced Publication
    • 最小化内存冗余的自动并行策略生成方法
    • In the field of large-scale deep learning, a novel algorithm has been proposed to generate optimal parallelism strategies with minimal memory redundancy. Expert researchers have formulated the parallelism strategy search problem into an integer linear programming problem and used an efficient solver to find minimal-memory intra-operator parallelism strategies. This approach achieves memory savings of up to 67% compared to the latest Megatron-LM strategies.
    • Frontiers of Information Technology & Electronic Engineering   Vol. 26, Issue 1, Pages: 109-118(2025)
    • DOI:10.1631/FITEE.2300684    

      CLC: TP181
    • Received:10 October 2023

      Revised:2023-10-17

      Published Online:27 December 2024

      Published:2025-01

    Scan QR Code

  • Yanqi SHI, Peng LIANG, Hao ZHENG, et al. Automatic parallelism strategy generation with minimal memory redundancy[J]. Frontiers of Information Technology & Electronic Engineering, 2025, 26(1): 109-118. DOI: 10.1631/FITEE.2300684.

  •  
  •  

0

Views

244

Downloads

0

CSCD

>
Alert me when the article has been cited
Submit
Tools
Download
Export Citation
Share
Add to favorites
Add to my album

Related Articles

MltAuxTSPP: a unified benchmark for deep learning-based traffic state prediction with multi-source auxiliary data
Q-space-coordinate-guided neural networks for high-fidelity diffusion tensor estimation from minimal diffusion-weighted images
CRGT-SA: an interlaced and spatiotemporal deep learning model for network intrusion detection
Federated model with contrastive learning and adaptive control variates for human activity recognition
A comprehensive survey of physical adversarial vulnerabilities in autonomous driving systems

Related Author

Yusong ZHOU
Xiaoyu JIANG
Shu SUN
Xinmin ZHANG
Yuanqiu MO
Zhihuan SONG
Maokun ZHENG
Zhi LI

Related Institution

College of Control Science and Engineering, Zhejiang University
State Key Laboratory of Industrial Control Technology, Zhejiang University
School of Mathematics, Southeast University
State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University
School of Electronic and Electrical Engineering, Shanghai University of Engineering Science
0