Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes

被引:5
|
作者
Qiao, Junfei [1 ,2 ]
Zhao, Mingming [1 ,2 ]
Wang, Ding [1 ,2 ]
Li, Menghua [1 ,2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
关键词
Action-dependent heuristic dynamic programming (ADHDP); adaptive critic control; adaptive dynamic programming (ADP); tracking control; wastewater treatment applications; DISSOLVED-OXYGEN CONTROL; OPTIMAL TRACKING;
D O I
10.1109/TII.2023.3344130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The wastewater treatment process (WWTP) is beneficial for maintaining sufficient water resources and recycling wastewater. A crucial link of WWTP is to ensure that the dissolved oxygen (DO) concentration is continuously maintained at the predetermined value, which can actually be considered as a tracking problem. In this article, an experience replay-based action-dependent heuristic dynamic programming (ER-ADHDP) method is developed to design the model-free tracking controller to accomplish the tracking goal of the DO concentration. First, the online ER-ADHDP controller is regarded as a supplementary controller to conduct the model-free tracking control alongside a stabilizing controller with a priori knowledge. The online ER-ADHDP method can adaptively adjust weight parameters of critic and action networks, thereby continuously ameliorating the tracking result over time. Second, the ER technique is integrated into the critic and action networks to promote the data utilization efficiency and accelerate the learning process. Third, a rational stability result is provided to theoretically ensure the usefulness of the ER-ADHDP tracking design. Finally, simulation experiments including different reference trajectories are conducted to show the superb tracking performance and excellent adaptability of the proposed ER-ADHDP method.
引用
收藏
页码:6257 / 6265
页数:9
相关论文
共 50 条
  • [11] Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method
    Ruan, Xiaogang
    Chen, Jing
    Yu, Naigong
    NEUROCOMPUTING, 2012, 93 : 27 - 40
  • [12] Convergence and numerical stability of action-dependent heuristic dynamic programming algorithms based on RLS learning for online DLQR optimal control
    de Sousa, Guilherme Bonfim
    Moraes Rego, Patricia Helena
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 317 - 334
  • [13] Dynamic bargaining with action-dependent valuations
    Lemke, RJ
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2004, 28 (09): : 1847 - 1875
  • [14] Prioritizing Useful Experience Replay for Heuristic Dynamic Programming-Based Learning Systems
    Ni, Zhen
    Malla, Naresh
    Zhong, Xiangnan
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (11) : 3911 - 3922
  • [15] Reinforcement control via action dependent heuristic dynamic programming
    Tang, KW
    Srikant, G
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1766 - 1770
  • [16] Application of heuristic dynamic programming to wastewater treatment process control
    Bo, Y.-C. (boyingchun@sina.com.cn), 2013, South China University of Technology (30):
  • [17] Supplementary heuristic dynamic programming for wastewater treatment process control
    Wang, Ding
    Li, Xin
    Xin, Peng
    Liu, Ao
    Qiao, Junfei
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [18] Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2741 - 2746
  • [19] Action dependent heuristic dynamic programming for home energy resource scheduling
    Fuselli, Danilo
    De Angelis, Francesco
    Boaro, Matteo
    Squartini, Stefano
    Wei, Qinglai
    Liu, Derong
    Piazza, Francesco
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2013, 48 : 148 - 160
  • [20] Renewable Energy Management Using Action Dependent Heuristic Dynamic Programming
    Sterling, Gulnaz
    Tyler, Benjamin
    2018 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2018,