Actor-Critic Deep Reinforcement Learning for Energy Minimization in UAV-Aided Networks

被引：0

作者：

Yuan, Yaxiong ^{[1
]}

Lei, Lei ^{[1
]}

Vu, Thang X. ^{[1
]}

Chatzinotas, Symeon ^{[1
]}

Ottersten, Bjorn ^{[1
]}

机构：

[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, Luxembourg, Luxembourg

来源：

2020 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC 2020) | 2020年

关键词：

UAV-aided networks; deep reinforcement learning; actor-critic; user scheduling; energy minimization; OPTIMIZATION;

D O I：

10.1109/eucnc48522.2020.9200931

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we investigate a user-timeslot scheduling problem for downlink unmanned aerial vehicle (UAV)-aided networks, where the UAV serves as an aerial base station. We formulate an optimization problem by jointly determining user scheduling and hovering time to minimize UAV's transmission and hovering energy. An offline algorithm is proposed to solve the problem based on the branch and bound method and the golden section search. However, executing the offline algorithm suffers from the exponential growth of computational time. Therefore, we apply a deep reinforcement learning (DRL) method to design an online algorithm with less computational time. To this end, we first reformulate the original user scheduling problem to a Markov decision process (MDP). Then, an actor-critic-based RL algorithm is developed to determine the scheduling policy under the guidance of two deep neural networks. Numerical results show the proposed online algorithm obtains a good tradeoff between performance gain and computational time.

引用

页码：348 / 352

页数：5

共 50 条

[1] Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization
Yuan, Yaxiong
Lei, Lei
Vu, Thang X.
Chatzinotas, Symeon
Sun, Sumei
Ottersten, Bjorn
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (05) : 5028 - 5042
[2] Distillation and Ordinary Federated Learning Actor-Critic Algorithms in Heterogeneous UAV-Aided Networks
Nasr-Azadani, Maedeh
Abouei, Jamshid
Plataniotis, Konstantinos N. N.
IEEE ACCESS, 2023, 11 : 44205 - 44220
[3] Constrained Soft Actor-Critic for Energy-Aware Trajectory Design in UAV-Aided IoT Networks
Zhou, Xuanhan
Zhang, Xiaochen
Zhao, Haitao
Xiong, Jun
Wei, Jibo
IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (07) : 1414 - 1418
[4] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[5] Visual Navigation with Actor-Critic Deep Reinforcement Learning
Shao, Kun
Zhao, Dongbin
Zhu, Yuanheng
Zhang, Qichao
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[6] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[7] Averaged Soft Actor-Critic for Deep Reinforcement Learning
Ding, Feng
Ma, Guanfeng
Chen, Zhikui
Gao, Jing
Li, Peng
COMPLEXITY, 2021, 2021
[8] Delanalty Minimization With Reinforcement Learning in UAV-Aided Mobile Network
Tseng, Fan-Hsun
Hsieh, Yu-Jung
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1991 - 2001
[9] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
Zhong, Chen
Lu, Ziyang
Gursoy, M. Cenk
Velipasalar, Senem
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
[10] A Prioritized objective actor-critic method for deep reinforcement learning
Ngoc Duy Nguyen
Thanh Thi Nguyen
Peter Vamplew
Richard Dazeley
Saeid Nahavandi
Neural Computing and Applications, 2021, 33 : 10335 - 10349

← 1 2 3 4 5 →