Satellite Communication Resource Scheduling Using a Dynamic Weight-Based Soft Actor Critic Reinforcement Learning

被引:0
|
作者
Qiao, Zhimin [1 ]
Yang, Weibo [2 ]
Li, Feng [1 ]
Li, Yongwei [1 ]
Zhang, Ye [1 ]
机构
[1] Taiyuan Inst Technol, Dept Automat, Taiyuan 030008, Peoples R China
[2] Changan Univ, Sch Automobile, Xian 710064, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Satellites; Heuristic algorithms; Dynamic scheduling; Task analysis; Resource management; Convergence; Optimal scheduling; Reinforcement learning; satellite resource scheduling; dynamic weight; soft actor critic; ALLOCATION;
D O I
10.1109/ACCESS.2024.3438930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the key challenge faced by space-based network is how to maximize the demand for on-board resources for ground communication tasks, given the limited availability of satellite resources. For this challenge, firstly, we propose a joint state space of satellite task requirements and resource pools to obtain the global information of the environment, avoiding convergence to local optimal strategies. Secondly, we propose a new joint partitioning method for frequency and time resources, which avoids the fragmentation of the resource to the maximum extent. Thirdly, a new algorithm called dynamic weight based soft actor critic (DWSAC) is proposed, which enhances the update range when the actions taken by the agent significantly contribute to the improvement of system performance, otherwise weakens the update range, significantly improving the convergence efficiency and performance of the soft actor critic (SAC). The results show that the proposed model and algorithm have good practicability, which can make the average resource occupancy rate higher and the running cost lower.
引用
下载
收藏
页码:111653 / 111662
页数:10
相关论文
共 50 条
  • [31] Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning
    Yang, Qin
    Parasuraman, Ramviyas
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 646 - 648
  • [32] Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems
    Liu, Chien-Liang
    Chang, Chuan-Chin
    Tseng, Chun-Jan
    IEEE ACCESS, 2020, 8 : 71752 - 71762
  • [33] Mutli-agent consensus under communication failure using Actor-Critic Reinforcement Learning
    Kandath, Harikumar
    Senthilnath, J.
    Sundaram, Suresh
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1461 - 1465
  • [34] Dynamic Spectrum Sharing Based on Federated Learning and Multi-Agent Actor-Critic Reinforcement Learning
    Yang, Tongtong
    Zhang, Wensheng
    Bo, Yulian
    Sun, Jian
    Wang, Cheng-Xiang
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 947 - 952
  • [35] A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access
    Zhong, Chen
    Lu, Ziyang
    Gursoy, M. Cenk
    Velipasalar, Senem
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (04) : 1125 - 1139
  • [36] CONTROLLED SENSING AND ANOMALY DETECTION VIA SOFT ACTOR-CRITIC REINFORCEMENT LEARNING
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4198 - 4202
  • [37] SAC-FACT: Soft Actor-Critic Reinforcement Learning for Counterfactual Explanations
    Ezzeddine, Fatima
    Ayoub, Omran
    Andreoletti, Davide
    Giordano, Silvia
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT I, 2023, 1901 : 195 - 216
  • [38] SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY
    Yan, Tao
    Zhang, Wenan
    Yang, Simon X.
    Yu, Li
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2019, 34 (05): : 536 - 543
  • [39] Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach
    Pan, Jie
    Huang, Jingwei
    Cheng, Gengdong
    Zeng, Yong
    NEURAL NETWORKS, 2023, 157 : 288 - 304
  • [40] Coverage Path Planning Using Actor–Critic Deep Reinforcement Learning
    Garrido-Castañeda, Sergio Isahí
    Vasquez, Juan Irving
    Antonio-Cruz, Mayra
    Sensors, 2025, 25 (05)