Efficient Massive-Device Orchestration Through Reinforcement Learning With Boosted Deep Deterministic Policy Gradient

被引:0
|
作者
Shi, Haowei [1 ]
Zou, Jiadao [1 ]
Zhang, Qingxue [1 ,2 ]
机构
[1] Purdue Sch Engn & Technol, Dept Elect & Comp Engn, Indianapolis, IN 46202 USA
[2] Purdue Sch Engn & Technol, Dept Biomed Engn, Indianapolis, IN 46202 USA
关键词
Big data; deep deterministic policy gradient (DDPG); Internet of Things (IoT); system configuration; wearable computer;
D O I
10.1109/JIOT.2023.3301795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data innovations are boosted by massive devices that capture a large amount of dynamics from human or environment and further mine the insights hidden in the dynamics. However, the challenge arises in the complex massive-device orchestration, meaning that it is essential to configure and manage the massive devices and the gateway/server. The complexity, on the massive wearable or Internet of Things devices, lies in the diverse energy budget, computing efficiency, and communication channel conditions. On the phone or server side, it lies in how the global diversity can be analyzed and how the system configuration can be optimized. Targeting this obstacle, we propose a new reinforcement learning architecture, called boosted deep deterministic policy gradient, with enhanced actor-critic co-learning and multiview state-transformation. More specifically, the proposed actor-critic co-learning allows for enhanced dynamics abstraction through the shared neural network component. Further, the state-transformation, with multiple parallel learning agents, greatly boosts the action quality and learning process. Evaluated on complex massive-device orchestration tasks, the proposed deep reinforcement learning framework has achieved much more efficient system configurations with enhanced computing capabilities and energy efficiency. This study will greatly advance massive-device system configuration through deep learning and reinforcement rewarding mechanisms, toward efficient big data practices.
引用
收藏
页码:5143 / 5154
页数:12
相关论文
共 50 条
  • [21] Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)
    Zholtayev, Darkhan
    Rubagotti, Matteo
    Do, Ton Duc
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (04): : 1889 - 1906
  • [22] Active control of flexible rotors using deep reinforcement learning with application of multi-actor-critic deep deterministic policy gradient
    Ahmed, Maheed H.
    AboHussien, Abdullah
    El-Shafei, Aly
    Darwish, Ahmed M.
    Abdel-Gawad, Ahmed H.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [23] A Self-Adaptive Vibration Reduction Method Based on Deep Deterministic Policy Gradient (DDPG) Reinforcement Learning Algorithm
    Jin, Xin
    Ma, Hongbao
    Kang, Yihua
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [24] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    [J]. Structural Control and Health Monitoring, 2022, 29 (10)
  • [25] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    [J]. STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
  • [26] Composite deep learning control for autonomous bicycles by using deep deterministic policy gradient
    He, Kanghui
    Dong, Chaoyang
    Yan, An
    Zheng, Qingyuan
    Liang, Bin
    Wang, Qing
    [J]. IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 2766 - 2773
  • [27] Applicability of Deep Reinforcement Learning for Efficient Federated Learning in Massive IoT Communications
    Tam, Prohim
    Corrado, Riccardo
    Eang, Chanthol
    Kim, Seokhoon
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [28] Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient
    Gong, Hui
    Wang, Peng
    Ni, Cui
    Cheng, Nuo
    [J]. SENSORS, 2022, 22 (09)
  • [29] Efficient Novelty Search Through Deep Reinforcement Learning
    Shi, Longxiang
    Li, Shijian
    Zheng, Qian
    Yao, Min
    Pan, Gang
    [J]. IEEE ACCESS, 2020, 8 : 128809 - 128818
  • [30] Policy ensemble gradient for continuous control problems in deep reinforcement learning
    Liu, Guoqiang
    Chen, Gang
    Huang, Victoria
    [J]. NEUROCOMPUTING, 2023, 548