Efficient Massive-Device Orchestration Through Reinforcement Learning With Boosted Deep Deterministic Policy Gradient

被引:0
|
作者
Shi, Haowei [1 ]
Zou, Jiadao [1 ]
Zhang, Qingxue [1 ,2 ]
机构
[1] Purdue Sch Engn & Technol, Dept Elect & Comp Engn, Indianapolis, IN 46202 USA
[2] Purdue Sch Engn & Technol, Dept Biomed Engn, Indianapolis, IN 46202 USA
关键词
Big data; deep deterministic policy gradient (DDPG); Internet of Things (IoT); system configuration; wearable computer;
D O I
10.1109/JIOT.2023.3301795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data innovations are boosted by massive devices that capture a large amount of dynamics from human or environment and further mine the insights hidden in the dynamics. However, the challenge arises in the complex massive-device orchestration, meaning that it is essential to configure and manage the massive devices and the gateway/server. The complexity, on the massive wearable or Internet of Things devices, lies in the diverse energy budget, computing efficiency, and communication channel conditions. On the phone or server side, it lies in how the global diversity can be analyzed and how the system configuration can be optimized. Targeting this obstacle, we propose a new reinforcement learning architecture, called boosted deep deterministic policy gradient, with enhanced actor-critic co-learning and multiview state-transformation. More specifically, the proposed actor-critic co-learning allows for enhanced dynamics abstraction through the shared neural network component. Further, the state-transformation, with multiple parallel learning agents, greatly boosts the action quality and learning process. Evaluated on complex massive-device orchestration tasks, the proposed deep reinforcement learning framework has achieved much more efficient system configurations with enhanced computing capabilities and energy efficiency. This study will greatly advance massive-device system configuration through deep learning and reinforcement rewarding mechanisms, toward efficient big data practices.
引用
收藏
页码:5143 / 5154
页数:12
相关论文
共 50 条
  • [1] Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm
    Wu, Junta
    Li, Huiyun
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [2] Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient
    Zhan, Ming
    Fan, Jingjing
    Guo, Jianying
    [J]. IEEE ACCESS, 2023, 11 : 87732 - 87746
  • [3] An efficient and robust gradient reinforcement learning: Deep comparative policy
    Wang, Jiaguo
    Li, Wenheng
    Lei, Chao
    Yang, Meng
    Pei, Yang
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 3773 - 3788
  • [4] Strategy Generation Based on Reinforcement Learning with Deep Deterministic Policy Gradient for UCAV
    Ma, Yunhong
    Bai, Shuyao
    Zhao, Yifei
    Song, Chao
    Yang, Jie
    [J]. 16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 789 - 794
  • [5] Reinforcement Learning for Mobile Robot Obstacle Avoidance with Deep Deterministic Policy Gradient
    Chen, Miao
    Li, Wenna
    Fei, Shihan
    Wei, Yufei
    Tu, Mingyang
    Li, Jiangbo
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 197 - 204
  • [6] Mutual Deep Deterministic Policy Gradient Learning
    Sun, Zhou
    [J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 508 - 513
  • [7] Reinforcement Learning Control with Deep Deterministic Policy Gradient Algorithm for Multivariable pH Process
    Panjapornpon, Chanin
    Chinchalongporn, Patcharapol
    Bardeeniz, Santi
    Makkayatorn, Ratthanita
    Wongpunnawat, Witchaya
    [J]. PROCESSES, 2022, 10 (12)
  • [8] Deep deterministic policy gradient to regulate feedback control systems using reinforcement learning
    Arshad, Jehangir
    Khan, Ayesha
    Aftab, Mariam
    Hussain, Mujtaba
    Rehman, Ateeq Ur
    Ahmad, Shafiq
    Al-Shayea, Adel M.
    Shafiq, Muhammad
    [J]. Computers, Materials and Continua, 2022, 71 (01): : 1153 - 1169
  • [9] Improvement of PMSM Control Using Reinforcement Learning Deep Deterministic Policy Gradient Agent
    Nicola, Marcel
    Nicola, Claudiu-Ionel
    [J]. 2021 21ST INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS (EE 2021), 2021,
  • [10] Independent Deep Deterministic Policy Gradient Reinforcement Learning in Cooperative Multiagent Pursuit Games
    Zhou, Shiyang
    Ren, Weiya
    Ren, Xiaoguang
    Wang, Yanzhen
    Yi, Xiaodong
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 625 - 637