Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

被引:1
|
作者
Lyu, Yifeng [1 ]
Hu, Han [1 ]
Fan, Rongfei [2 ]
Liu, Zhi [3 ]
An, Jianping [2 ]
Mao, Shiwen [4 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[4] Auburn Univ, Dept Elect & Comp Engn, 5201 USA, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
Integrated satellite-terrestrial networks; dynamic routing algorithm; end-to-end delay; constrained multi-agent reinforcement learning; CONSTELLATION; INTERNET;
D O I
10.1109/JSAC.2024.3365869
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.
引用
下载
收藏
页码:1204 / 1218
页数:15
相关论文
共 50 条
  • [31] Joint Network Function Placement and Routing Optimization in Dynamic Software-Defined Satellite-Terrestrial Integrated Networks
    Yuan, Shuo
    Sun, Yaohua
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 5172 - 5186
  • [32] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Yi-Chi Wang
    John M. Usher
    The International Journal of Advanced Manufacturing Technology, 2007, 33 : 323 - 333
  • [33] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Wang, Yi-Chi
    Usher, John M.
    International Journal of Advanced Manufacturing Technology, 2007, 33 (3-4): : 323 - 333
  • [34] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Wang, Yi-Chi
    Usher, John M.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 33 (3-4): : 323 - 333
  • [35] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157
  • [36] Distributed reinforcement learning in multi-agent networks
    Kar, Soummya
    Moura, Jose M. F.
    Poor, H. Vincent
    2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013), 2013, : 296 - +
  • [37] Collaborative Deep Reinforcement Learning in 6G Integrated Satellite-Terrestrial Networks: Paradigm, Solutions, and Trends
    Yang, Yang
    He, Xinyu
    Lee, Jemin
    He, Dazhong
    Lu, Yonghui
    IEEE COMMUNICATIONS MAGAZINE, 2024,
  • [38] Multi-agent systems on sensor networks: A distributed reinforcement learning approach
    Tham, CK
    Renaud, JC
    PROCEEDINGS OF THE 2005 INTELLIGENT SENSORS, SENSOR NETWORKS & INFORMATION PROCESSING CONFERENCE, 2005, : 423 - 429
  • [39] Multi-agent reinforcement learning for cooperative trajectory design of UAV-BS fleets in terrestrial/non-terrestrial integrated networks
    Hoang, Linh T.
    Nguyen, Chuyen T.
    Le, Hoang D.
    Pham, Anh T.
    IEICE COMMUNICATIONS EXPRESS, 2024, 13 (08): : 327 - 330
  • [40] Integrated Satellite-Terrestrial Routing Using Distributionally Robust Optimization
    Tsai, Kai-Chu
    Fan, Lei
    Lent, Ricardo
    Wang, Li-Chun
    Han, Zhu
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 277 - 282