Optimized Deep Reinforcement Learning Approach for Dynamic System

被引:8
|
作者
Tan, Ziya [1 ]
Karakose, Mehmet [2 ]
机构
[1] Erzincan Binali Yildirim Univ, Erzincan, Turkey
[2] Firat Univ, Elazig, Turkey
关键词
CartPole; Deep Reinforcement Learning; Deep Q-Learning;
D O I
10.1109/isse49799.2020.9272245
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Reinforcement learning methods provide significant and impressive improvements in artificial intelligence studies in recent years, especially in Atari and Go games. This development attracts the attention of scientists who try to understand how people learn. In addition, the biggest advantage of reinforcement learning over other learning algorithms is that it does not require any prior data. This feature distinguishes Reinforcement Learning from others. Reinforcement Learning is an approach in which smart programs work in a certain or uncertain environment to constantly adapt and learn based on scoring. The feedback process is also known as a reward or called a penalty. Given Agents and environment, it is determined which action to take. In this study, we apply the success of Deep Q-Learning (DQL) algorithm, one of the model-free based deep reinforcement learning algorithms used in the literature, on the CartPole problem. We also offer a different method to improve the agent's success during the training phase. We present an optimized DQL algorithm using a function that updates the weights of the neural network at every step.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Dynamic VNF Scheduling: A Deep Reinforcement Learning Approach
    Zhang, Zixiao
    He, Fujun
    Oki, Eiji
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (07) : 557 - 570
  • [2] Teleconsultation dynamic scheduling with a deep reinforcement learning approach
    Chen, Wenjia
    Li, Jinlin
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 149
  • [3] A Deep Reinforcement Learning Approach for Dynamic Contents Caching in HetNets
    Ma, Manyou
    Wong, Vincent W. S.
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [4] Dynamic Offloading Loading Optimization in Distributed Fault Diagnosis System with Deep Reinforcement Learning Approach
    Yu, Liang
    Guo, Qixin
    Wang, Rui
    Shi, Minyan
    Yan, Fucheng
    Wang, Ran
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [5] A Hybrid Deep Reinforcement Learning Approach for Dynamic Task Offloading in NOMA-MEC System
    Shang, Ce
    Sun, Yan
    Luo, Hong
    [J]. 2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2022, : 434 - 442
  • [6] Dynamic Pricing for EV Charging Stations: A Deep Reinforcement Learning Approach
    Zhao, Zhonghao
    Lee, Carman K. M.
    [J]. IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2022, 8 (02) : 2456 - 2468
  • [7] UAV navigation in high dynamic environments: A deep reinforcement learning approach
    Guo, Tong
    Jiang, Nan
    Li, Biyue
    Zhu, Xi
    Wang, Ya
    Du, Wenbo
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (02) : 479 - 489
  • [8] A Deep Reinforcement Learning Approach to Fair Distributed Dynamic Spectrum Access
    Jalil, Syed Qaisar
    Rehmani, Mubashir Husain
    Chalup, Stephan
    [J]. PROCEEDINGS OF THE 17TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2020), 2021, : 236 - 244
  • [9] UAV navigation in high dynamic environments:A deep reinforcement learning approach
    Tong GUO
    Nan JIANG
    Biyue LI
    Xi ZHU
    Ya WANG
    Wenbo DU
    [J]. Chinese Journal of Aeronautics, 2021, 34 (02) : 479 - 489
  • [10] A dynamic penalty approach to state constraint handling in deep reinforcement learning
    Yoo, Haeun
    Zavala, Victor M.
    Lee, Jay H.
    [J]. JOURNAL OF PROCESS CONTROL, 2022, 115 : 157 - 166