Satellite Communication Resource Scheduling Using a Dynamic Weight-Based Soft Actor Critic Reinforcement Learning

被引:0
|
作者
Qiao, Zhimin [1 ]
Yang, Weibo [2 ]
Li, Feng [1 ]
Li, Yongwei [1 ]
Zhang, Ye [1 ]
机构
[1] Taiyuan Inst Technol, Dept Automat, Taiyuan 030008, Peoples R China
[2] Changan Univ, Sch Automobile, Xian 710064, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Satellites; Heuristic algorithms; Dynamic scheduling; Task analysis; Resource management; Convergence; Optimal scheduling; Reinforcement learning; satellite resource scheduling; dynamic weight; soft actor critic; ALLOCATION;
D O I
10.1109/ACCESS.2024.3438930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the key challenge faced by space-based network is how to maximize the demand for on-board resources for ground communication tasks, given the limited availability of satellite resources. For this challenge, firstly, we propose a joint state space of satellite task requirements and resource pools to obtain the global information of the environment, avoiding convergence to local optimal strategies. Secondly, we propose a new joint partitioning method for frequency and time resources, which avoids the fragmentation of the resource to the maximum extent. Thirdly, a new algorithm called dynamic weight based soft actor critic (DWSAC) is proposed, which enhances the update range when the actions taken by the agent significantly contribute to the improvement of system performance, otherwise weakens the update range, significantly improving the convergence efficiency and performance of the soft actor critic (SAC). The results show that the proposed model and algorithm have good practicability, which can make the average resource occupancy rate higher and the running cost lower.
引用
下载
收藏
页码:111653 / 111662
页数:10
相关论文
共 50 条
  • [21] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
    Zhong, Chen
    Lu, Ziyang
    Gursoy, M. Cenk
    Velipasalar, Senem
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
  • [22] Dynamic Charging Scheme Problem With Actor-Critic Reinforcement Learning
    Yang, Meiyi
    Liu, Nianbo
    Zuo, Lin
    Feng, Yong
    Liu, Minghui
    Gong, Haigang
    Liu, Ming
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (01) : 370 - 380
  • [23] Actor-Critic reinforcement learning based on prior knowledge
    Yang, Zhenyu, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [24] Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
    Ren, Yangang
    Duan, Jingliang
    Li, Shengbo Eben
    Guan, Yang
    Sun, Qi
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [25] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    COMPUTERS & SECURITY, 2023, 135
  • [26] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
    Pantoja-Garcia, Luis
    Parra-Vega, Vicente
    Garcia-Rodriguez, Rodolfo
    Vazquez-Garcia, Carlos Ernesto
    ROBOTICS, 2023, 12 (05)
  • [27] Dynamic power management of an embedded sensor network based on actor-critic reinforcement based learning
    Sridhar, Prasanna
    Nanayakkara, Thrishantha
    Madni, Asad M.
    Jamshidi, Mo
    2007 THIRD INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY, 2007, : 71 - +
  • [28] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fu, Fang
    Zhang, Zhicai
    Yu, Fei Richard
    Yan, Qiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1875 - 1889
  • [29] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
    Chen, Zheyi
    Hu, Jia
    Min, Geyong
    Luo, Chunbo
    El-Ghazawi, Tarek
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923
  • [30] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fang Fu
    Zhicai Zhang
    Fei Richard Yu
    Qiao Yan
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1875 - 1889