Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

被引:1
|
作者
Lyu, Yifeng [1 ]
Hu, Han [1 ]
Fan, Rongfei [2 ]
Liu, Zhi [3 ]
An, Jianping [2 ]
Mao, Shiwen [4 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[4] Auburn Univ, Dept Elect & Comp Engn, 5201 USA, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
Integrated satellite-terrestrial networks; dynamic routing algorithm; end-to-end delay; constrained multi-agent reinforcement learning; CONSTELLATION; INTERNET;
D O I
10.1109/JSAC.2024.3365869
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.
引用
收藏
页码:1204 / 1218
页数:15
相关论文
共 50 条
  • [21] Deep Reinforcement Learning-Based Joint Satellite Scheduling and Resource Allocation in Satellite-Terrestrial Integrated Networks
    Yin, Yabo
    Huang, Chuanhe
    Wu, Dong-Fang
    Huang, Shidong
    Ashraf, M. Wasim Abbas
    Guo, Qianqian
    Zhang, Lin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [22] Learning to Routing in UAV Swarm Network: A Multi-Agent Reinforcement Learning Approach
    Wang, Zunliang
    Yao, Haipeng
    Mai, Tianle
    Xiong, Zehui
    Wu, Xiaohua
    Wu, Di
    Guo, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 6611 - 6624
  • [23] A multi-agent reinforcement learning approach to dynamic service composition
    Wang, Hongbing
    Wang, Xiaojun
    Hu, Xingguo
    Zhang, Xingzhi
    Gu, Mingzhu
    INFORMATION SCIENCES, 2016, 363 : 96 - 119
  • [24] DeepCQ plus : Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks
    Kaviani, Saeed
    Ryu, Bo
    Ahmed, Ejaz
    Larson, Kevin
    Le, Anh
    Yahja, Alex
    Kim, Jae H.
    2021 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2021), 2021,
  • [25] Integrated management of large satellite-terrestrial networks
    Baras, JS
    Ball, M
    Roussopoulos, N
    Jang, K
    Stathatos, K
    Valluri, J
    MILCOM 97 PROCEEDINGS, VOLS 1-3, 1997, : 383 - 387
  • [26] Dynamic User Association for Resilient Backhauling in Satellite-Terrestrial Integrated Networks
    Dai, Cui-Qin
    Luo, Junfeng
    Fu, Shu
    Wu, Jinsong
    Chen, Qianbin
    IEEE SYSTEMS JOURNAL, 2020, 14 (04): : 5025 - 5036
  • [27] Multi-objective Intelligent Handover in Satellite-Terrestrial Integrated Networks
    Dai, Cui-Qin
    Xu, Jing
    Wut, Jinsong
    Chen, Qianbin
    2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 367 - 372
  • [28] A Multi-agent Reinforcement Learning based Routing Protocol for Wireless Sensor Networks
    Liang, Xuedong
    Balasingham, Ilangko
    Byun, Sang-Seon
    2008 IEEE INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS (ISWCS 2008), 2008, : 528 - +
  • [29] Federated split learning for sequential data in satellite-terrestrial integrated networks
    Jiang, Weiwei
    Han, Haoyu
    Zhang, Yang
    Mu, Jianbin
    INFORMATION FUSION, 2024, 103
  • [30] A Multi-Agent Reinforcement Learning Routing Protocol for Underwater Optical Sensor Networks
    Li, Xinge
    Hu, Xiaoya
    Li, Wei
    Hu, Hui
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,