Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

被引:0
|
作者
Mazouchi, Majid [1 ]
Yang, Yongliang [2 ]
Modares, Hamidreza [1 ]
机构
[1] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 10083, Peoples R China
来源
IFAC PAPERSONLINE | 2020年 / 53卷 / 02期
关键词
Multi-objective optimization; Pareto optimality; Reinforcement learning; Sum-of-Square theory; FEEDBACK-CONTROL;
D O I
10.1016/j.ifacol.2020.12.2275
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an iterative data-driven algorithm for solving dynamic multi-objective (MO) optimal control problems arising in control of nonlinear continuous-time systems with multiple objectives. It is first shown that the Hamiltonian function corresponding to each objective can serve as a comparison function to compare the performance of admissible policies. Relaxed Hamilton-Jacobi-bellman (HJB) equations in terms of HJB inequalities are then solved in a dynamic constrained MO framework to find Pareto-optimal solutions. Relation to satisficing (good enough) decision-making framework is shown. A Sum-of-Square (SOS)-based iterative algorithm is developed to solve the formulated MO optimization with HJB inequalities. To obviate the requirement of complete knowledge of the system dynamics, a data-driven satisficing reinforcement learning approach is proposed to solve the SOS optimization problem in real-time using only the information of the system trajectories measured during a time interval without having full knowledge of the system dynamics. Finally, a simulation example is provided to show the effectiveness of the proposed algorithm. Copyright (C) 2020 The Authors.
引用
收藏
页码:8070 / 8075
页数:6
相关论文
共 50 条
  • [1] A Data-Driven Reinforcement Learning Based Multi-Objective Route Recommendation System
    Sarker, Ankur
    Shen, Haiying
    Kowsari, Kamran
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2020), 2020, : 103 - 111
  • [2] Data-Driven Dynamic Multiobjective Optimal Control: An Aspiration-Satisfying Reinforcement Learning Approach
    Mazouchi, Majid
    Yang, Yongliang
    Modares, Hamidreza
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6183 - 6193
  • [3] Data-driven multi-objective intelligent optimal control of municipal solid waste incineration process
    Wang, Tianzheng
    Tang, Jian
    Xia, Heng
    Yang, Cuili
    Yu, Wen
    Qiao, Junfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [4] Data-Driven Reinforcement Learning for Optimal Motor Control in Washing Machines
    Kang, Chanseok
    Bae, Guntae
    Kim, Daesung
    Lee, Kyoungwoo
    Son, Dohyeon
    Lee, Chul
    Lee, Jaeho
    Lee, Jinwoo
    Yun, Jae Woong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 418 - 424
  • [5] Norm Optimal Iterative Learning Control: A Data-Driven Approach
    Jiang, Zheng
    Chu, Bing
    IFAC PAPERSONLINE, 2022, 55 (12): : 482 - 487
  • [6] Data-Driven Solutions to Mixed H2/H∞ Control: A Hamilton-Inequality-Driven Reinforcement Learning Approach
    Yang, Yongliang
    Mazouchi, Majid
    Modares, Hamidreza
    2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA), 2020, : 340 - 345
  • [7] MORL4PDEs: Data-driven discovery of PDEs based on multi-objective optimization and reinforcement learning
    Zhang, Xiaoxia
    Guan, Junsheng
    Liu, Yanjun
    Wang, Guoyin
    CHAOS SOLITONS & FRACTALS, 2024, 180
  • [8] Data-driven dynamic relatively optimal control
    Pellegrino, Felice A.
    Blanchini, Franco
    Fenu, Gianfranco
    Salvato, Erica
    EUROPEAN JOURNAL OF CONTROL, 2023, 74
  • [9] Multi-objective unit commitment under hybrid uncertainties: A data-driven approach
    Zhou, Min
    Wang, Bo
    Li, Tian-tian
    Watada, Junzo
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2018,
  • [10] A data-driven approach for multi-objective unit commitment under hybrid uncertainties
    Zhou, Min
    Wang, Bo
    Li, Tiantian
    Watada, Junzo
    ENERGY, 2018, 164 : 722 - 733