Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

被引:0
|
作者
Mazouchi, Majid [1 ]
Yang, Yongliang [2 ]
Modares, Hamidreza [1 ]
机构
[1] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 10083, Peoples R China
来源
IFAC PAPERSONLINE | 2020年 / 53卷 / 02期
关键词
Multi-objective optimization; Pareto optimality; Reinforcement learning; Sum-of-Square theory; FEEDBACK-CONTROL;
D O I
10.1016/j.ifacol.2020.12.2275
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an iterative data-driven algorithm for solving dynamic multi-objective (MO) optimal control problems arising in control of nonlinear continuous-time systems with multiple objectives. It is first shown that the Hamiltonian function corresponding to each objective can serve as a comparison function to compare the performance of admissible policies. Relaxed Hamilton-Jacobi-bellman (HJB) equations in terms of HJB inequalities are then solved in a dynamic constrained MO framework to find Pareto-optimal solutions. Relation to satisficing (good enough) decision-making framework is shown. A Sum-of-Square (SOS)-based iterative algorithm is developed to solve the formulated MO optimization with HJB inequalities. To obviate the requirement of complete knowledge of the system dynamics, a data-driven satisficing reinforcement learning approach is proposed to solve the SOS optimization problem in real-time using only the information of the system trajectories measured during a time interval without having full knowledge of the system dynamics. Finally, a simulation example is provided to show the effectiveness of the proposed algorithm. Copyright (C) 2020 The Authors.
引用
收藏
页码:8070 / 8075
页数:6
相关论文
共 50 条
  • [31] Data-Driven Product Prediction and Multi-Objective Optimal Operations of Wax Oil Hydrotreating Unit
    Tian, Shuimiao
    Cao, Cuiwen
    Shiyou Xuebao, Shiyou Jiagong/Acta Petrolei Sinica (Petroleum Processing Section), 2021, 37 (01): : 79 - 87
  • [32] DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING
    Wang, Yudan
    Wang, Yue
    Zhou, Yi
    Velasquez, Alvaro
    Zou, Shaofeng
    2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
  • [33] Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach
    Wang, Haozhe
    Wu, Yulei
    Min, Geyong
    Xu, Jie
    Tang, Pengcheng
    INFORMATION SCIENCES, 2019, 498 : 106 - 116
  • [34] Constrained data-driven optimal iterative learning control
    Chi, Ronghu
    Liu, Xiaohe
    Zhang, Ruikun
    Hou, Zhongsheng
    Huang, Biao
    JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29
  • [35] Data-driven optimal terminal iterative learning control
    Chi, Ronghu
    Wang, Danwei
    Hou, Zhongsheng
    Jin, Shangtai
    JOURNAL OF PROCESS CONTROL, 2012, 22 (10) : 2026 - 2037
  • [36] On a Probabilistic Approach for Inverse Data-Driven Optimal Control
    Garrabe, Emiland
    Jesawada, Hozefa
    Del Vecchio, Carmen
    Russo, Giovanni
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4411 - 4416
  • [37] On the Performance of Data-Driven Reinforcement Learning for Commercial HVAC Control
    Faddel, Samy
    Tian, Guanyu
    Zhou, Qun
    Aburub, Haneen
    2020 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2020,
  • [38] Safe Reinforcement Learning using Data-Driven Predictive Control
    Selim, Mahmoud
    Alanwar, Amr
    El-Kharashi, M. Watheq
    Abbas, Hazem M.
    Johansson, Karl H.
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [39] Multi-Objective Markov Decision Processes for Data-Driven Decision Support
    Lizotte, Daniel J.
    Laber, Eric B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [40] Multi-objective optimization of hydrocyclone by combining mechanistic and data-driven models
    Ye, Qing
    Duan, Peibo
    Kuang, Shibo
    Ji, Li
    Zou, Ruiping
    Yu, Aibing
    POWDER TECHNOLOGY, 2022, 407