Reinforcement Learning-Based Age of Information Optimization in UAV-Enabled Communication System

被引:0
|
作者
Li X. [1 ]
Yin B. [1 ]
Wei L. [1 ]
Zhang X. [1 ]
机构
[1] School of Information Engineering, Southwest University of Science and Technology, Mianyang
关键词
Age of information; Reinforcement learning; Trajectory optimization; Unmanned aerial vehicles; Wireless telecommunication system;
D O I
10.12178/1001-0548.2021128
中图分类号
学科分类号
摘要
Aiming at solving the characterization and optimization of information freshness in the sixth generation (6G) communication system, we firstly model information freshness based on the age of information (AoI) in the unmanned aerial vehicle (UAV) communication system and formulate an AoI minimization problem subjected to the energy consumption. However, the nonconvex problem is difficult to solve due to discreteness of AoI optimization and the complicated energy consumption expression. A reinforcement learning-based scheme is proposed to design the UAV's trajectory, in which the reward function related to AoI is constructed to realize a fast and intelligent UAV trajectory decision, thus reducing the AoI of UAV communication system. The simulation results show that, compared with the benchmark schemes, the proposed trajectory design scheme can improve the information freshness by 8.51%~21.82%. In addition, the proposed scheme has a superior convergence. Copyright ©2022 Journal of University of Electronic Science and Technology of China. All rights reserved.
引用
收藏
页码:213 / 218
页数:5
相关论文
共 20 条
  • [1] 6G white paper: Key drivers and research challenges for 6G ubiquitous wireless intelligence
  • [2] JIA Z Y, SHENG M, LI J D, Et al., LEO satellite-assisted UAV: Joint trajectory and data collection for internet of remote things in 6G aerial access networks, IEEE Internet of Things Journal, 8, 12, pp. 9814-9826, (2020)
  • [3] AGGARWAL S, KUMAR N, TANWAR S., Blockchain-envisioned UAV communication using 6G networks: Open issues, use cases, and future directions, IEEE Internet of Things Journal, 8, 7, pp. 5416-5441, (2021)
  • [4] JIANG X, YANG Z, ZHAO N, Et al., Resource allocation and trajectory optimization for UAV-enabled multi-user covert communications, IEEE Transactions on Vehicular Technology, 69, 7, pp. 7634-7647, (2021)
  • [5] ZENG Y, XU X, JIN S, Et al., Simultaneous navigation and radio mapping for cellular-connected UAV with deep reinforcement learning, IEEE Transactions on Wireless Communications, 20, 7, pp. 4205-4220, (2021)
  • [6] XU J, ZENG Y, ZHANG R., UAV-enabled wireless power transfer: Trajectory design and energy region characterization, IEEE Global Communications Conference, pp. 1-7, (2017)
  • [7] WANG J, XIAO L, YANG D C, Et al., Joint trajectory optimization and power control in UAV-to-ground communication, Modern Electronic Technology, 43, 17, pp. 6-10, (2020)
  • [8] HUANG Y Q, CUI M, ZHANG G C., Bandwidth, power and trajectory optimization for multiple UAV base stations with backhaul constraint, Application Research of Computers, 38, 3, pp. 831-840, (2021)
  • [9] LI X M, XU J., Positioning optimization for sum-rate maximization in UAV-enabled interference channel, IEEE Signal Processing Letters, 26, 10, pp. 1466-1470, (2019)
  • [10] BAYERLEIN H, KERRET P D, GESBERT D., Trajectory optimization for autonomous flying base station via reinforcement learning, IEEE International Workshop on Signal Processing Advances in Wireless Communications, pp. 1-5, (2018)