Data-Driven Learning and Control with Multiple Critic Networks

被引:0
|
作者
He, Haibo [1 ]
Ni, Zhen [1 ]
Zhao, Dongbin [2 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
adaptive dynamic programming (ADP); multiple critic networks; external reinforcement signal; internal reinforcement signal; goal representation; hierarchical structure; TIME NONLINEAR-SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we extend our previous work of a three-network adaptive dynamic programming design [1] to be a multiple critic networks design for online learning and control. The key idea of this approach is to develop a hierarchical internal goal representation to facilitate the online learning with detailed and informative internal value signal representations. We present our learning architecture in detail, and also demonstrate its performance on the popular cart-pole balancing benchmark. Simulation results demonstrate the effectiveness of our approach. We also present discussions of further research directions along this topic.
引用
收藏
页码:523 / 527
页数:5
相关论文
共 50 条
  • [31] Data-driven Adaptive Iterative Learning Predictive Control
    Lv, Yunkai
    Chi, Ronghu
    [J]. 2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 374 - 377
  • [32] Data-Driven Control of Hydraulic Manipulators by Reinforcement Learning
    Yao, Zhikai
    Xu, Fengyu
    Jiang, Guo-Ping
    Yao, Jianyong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2673 - 2684
  • [33] Data-Driven Iterative Adaptive Critic Control Toward an Urban Wastewater Treatment Plant
    Wang, Ding
    Ha, Mingming
    Qiao, Junfei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (08) : 7362 - 7369
  • [34] Data-Driven Resource Allocation for Deep Learning in IoT Networks
    Chun, Chang-Jae
    Jeong, Cheol
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) : 2082 - 2096
  • [35] Data-driven initialization and structure learning in fuzzy neural networks
    Setnes, M
    Koene, A
    Babuska, R
    Bruijn, PM
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1147 - 1152
  • [36] Multiple Affordances of Language Corpora in Data-driven Learning.
    Smart, Jon
    [J]. Corpora, 2016, 11 (03) : 465 - 468
  • [37] Data-Driven Learning Control for Stochastic Nonlinear Systems: Multiple Communication Constraints and Limited Storage
    Shen, Dong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2429 - 2440
  • [38] A Data-driven Optimal Iterative Learning Control with Data Loss Compensation
    Lv, Yunkai
    Chi, Ronghu
    Lin, Na
    [J]. PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 180 - 183
  • [39] Data-Driven Learning for H∞ Control of Adaptive Cruise Control Systems
    Zhao, Jun
    Wang, Zhangu
    Lv, Yongfeng
    Na, Jing
    Liu, Congzhi
    Zhao, Ziliang
    [J]. IEEE Transactions on Vehicular Technology, 2024, 73 (12): : 18348 - 18362
  • [40] Data-driven identifier–actor–critic learning for cooperative spacecraft attitude tracking with orientation constraints
    Xia, Kewei
    Wang, Jianan
    Zou, Yao
    Gao, Hongbo
    Ding, Zhengtao
    [J]. Automatica, 2025, 173