Actor-Critic Deep Reinforcement Learning for Energy Minimization in UAV-Aided Networks

被引：0

作者：

Yuan, Yaxiong ^{[1
]}

Lei, Lei ^{[1
]}

Vu, Thang X. ^{[1
]}

Chatzinotas, Symeon ^{[1
]}

Ottersten, Bjorn ^{[1
]}

机构：

[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, Luxembourg, Luxembourg

来源：

2020 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC 2020) | 2020年

关键词：

UAV-aided networks; deep reinforcement learning; actor-critic; user scheduling; energy minimization; OPTIMIZATION;

D O I：

10.1109/eucnc48522.2020.9200931

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we investigate a user-timeslot scheduling problem for downlink unmanned aerial vehicle (UAV)-aided networks, where the UAV serves as an aerial base station. We formulate an optimization problem by jointly determining user scheduling and hovering time to minimize UAV's transmission and hovering energy. An offline algorithm is proposed to solve the problem based on the branch and bound method and the golden section search. However, executing the offline algorithm suffers from the exponential growth of computational time. Therefore, we apply a deep reinforcement learning (DRL) method to design an online algorithm with less computational time. To this end, we first reformulate the original user scheduling problem to a Markov decision process (MDP). Then, an actor-critic-based RL algorithm is developed to determine the scheduling policy under the guidance of two deep neural networks. Numerical results show the proposed online algorithm obtains a good tradeoff between performance gain and computational time.

引用

页码：348 / 352

页数：5

共 50 条

[21] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
Bakhanova, Maria
Makarov, Ilya
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
[22] A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access
Zhong, Chen
Lu, Ziyang
Gursoy, M. Cenk
Velipasalar, Senem
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (04) : 1125 - 1139
[23] A World Model for Actor-Critic in Reinforcement Learning
Panov, A. I.
Ugadiarov, L. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
[24] Curious Hierarchical Actor-Critic Reinforcement Learning
Roeder, Frank
Eppe, Manfred
Nguyen, Phuong D. H.
Wermter, Stefan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 408 - 419
[25] Actor-Critic based Improper Reinforcement Learning
Zaki, Mohammadi
Mohan, Avinash
Gopalan, Aditya
Mannor, Shie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[26] Applying Online Expert Supervision in Deep Actor-Critic Reinforcement Learning
Zhang, Jin
Chen, Jiansheng
Huang, Yiqing
Wan, Weitao
Li, Tianpeng
PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 469 - 478
[27] Coverage Path Planning Using Actor-Critic Deep Reinforcement Learning
Garrido-Castaneda, Sergio Isahi
Vasquez, Juan Irving
Antonio-Cruz, Mayra
SENSORS, 2025, 25 (05)
[28] Fully distributed actor-critic architecture for multitask deep reinforcement learning
Valcarcel Macua, Sergio
Davies, Ian
Tukiainen, Aleksi
De Cote, Enrique Munoz
KNOWLEDGE ENGINEERING REVIEW, 2021, 36
[29] A fuzzy Actor-Critic reinforcement learning network
Wang, Xue-Song
Cheng, Yu-Hu
Yi, Jian-Qiang
INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
[30] A modified actor-critic reinforcement learning algorithm
Mustapha, SM
Lachiver, G
2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609

← 1 2 3 4 5 →