Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach
Special Issue on Artificial-Intelligence-Empowered Digital-Twin-Based Network Autonomy|Updated:2025-03-13
|
Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach
Enhanced Publication
基于模型强化学习的边缘计算无线大语言模型推理自适应层切分方法
“In the field of edge computing, this study introduces a framework inspired by model-based reinforcement learning to optimize the deployment of large language models, significantly reducing computational costs and balancing inference performance and computational load under various network conditions.”
Frontiers of Information Technology & Electronic EngineeringVol. 26, Issue 2, Pages: 278-292(2025)
Affiliations:
1.College of Information Science & Electronic Engineering, Zhejiang University, Hangzhou 310027, China
Yuxuan CHEN, Rongpeng LI, Xiaoxue YU, et al. Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach[J]. Frontiers of information technology & electronic engineering, 2025, 26(2): 278-292.
DOI:
Yuxuan CHEN, Rongpeng LI, Xiaoxue YU, et al. Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approach[J]. Frontiers of information technology & electronic engineering, 2025, 26(2): 278-292. DOI: 10.1631/FITEE.2400468.
Adaptive layer splitting for wireless large language model inference in edge computing: a model-based reinforcement learning approachEnhanced Publication