Data-driven dynamic multi-objective optimal control: A Hamiltonian-inequality driven satisficing reinforcement learning approach

被引：0

作者：

Mazouchi, Majid ^{[1
]}

Yang, Yongliang ^{[2
]}

Modares, Hamidreza ^{[1
]}

机构：

[1] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA

[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 10083, Peoples R China

来源：

IFAC PAPERSONLINE | 2020年 / 53卷 / 02期

关键词：

Multi-objective optimization; Pareto optimality; Reinforcement learning; Sum-of-Square theory; FEEDBACK-CONTROL;

D O I：

10.1016/j.ifacol.2020.12.2275

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an iterative data-driven algorithm for solving dynamic multi-objective (MO) optimal control problems arising in control of nonlinear continuous-time systems with multiple objectives. It is first shown that the Hamiltonian function corresponding to each objective can serve as a comparison function to compare the performance of admissible policies. Relaxed Hamilton-Jacobi-bellman (HJB) equations in terms of HJB inequalities are then solved in a dynamic constrained MO framework to find Pareto-optimal solutions. Relation to satisficing (good enough) decision-making framework is shown. A Sum-of-Square (SOS)-based iterative algorithm is developed to solve the formulated MO optimization with HJB inequalities. To obviate the requirement of complete knowledge of the system dynamics, a data-driven satisficing reinforcement learning approach is proposed to solve the SOS optimization problem in real-time using only the information of the system trajectories measured during a time interval without having full knowledge of the system dynamics. Finally, a simulation example is provided to show the effectiveness of the proposed algorithm. Copyright (C) 2020 The Authors.

引用

页码：8070 / 8075

页数：6

共 50 条

[31] Data-Driven Product Prediction and Multi-Objective Optimal Operations of Wax Oil Hydrotreating Unit
Tian, Shuimiao
Cao, Cuiwen
Shiyou Xuebao, Shiyou Jiagong/Acta Petrolei Sinica (Petroleum Processing Section), 2021, 37 (01): : 79 - 87
[32] DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING
Wang, Yudan
Wang, Yue
Zhou, Yi
Velasquez, Alvaro
Zou, Shaofeng
2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
[33] Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach
Wang, Haozhe
Wu, Yulei
Min, Geyong
Xu, Jie
Tang, Pengcheng
INFORMATION SCIENCES, 2019, 498 : 106 - 116
[34] Constrained data-driven optimal iterative learning control
Chi, Ronghu
Liu, Xiaohe
Zhang, Ruikun
Hou, Zhongsheng
Huang, Biao
JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29
[35] Data-driven optimal terminal iterative learning control
Chi, Ronghu
Wang, Danwei
Hou, Zhongsheng
Jin, Shangtai
JOURNAL OF PROCESS CONTROL, 2012, 22 (10) : 2026 - 2037
[36] On a Probabilistic Approach for Inverse Data-Driven Optimal Control
Garrabe, Emiland
Jesawada, Hozefa
Del Vecchio, Carmen
Russo, Giovanni
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4411 - 4416
[37] On the Performance of Data-Driven Reinforcement Learning for Commercial HVAC Control
Faddel, Samy
Tian, Guanyu
Zhou, Qun
Aburub, Haneen
2020 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2020,
[38] Safe Reinforcement Learning using Data-Driven Predictive Control
Selim, Mahmoud
Alanwar, Amr
El-Kharashi, M. Watheq
Abbas, Hazem M.
Johansson, Karl H.
2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
[39] Multi-Objective Markov Decision Processes for Data-Driven Decision Support
Lizotte, Daniel J.
Laber, Eric B.
JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
[40] Multi-objective optimization of hydrocyclone by combining mechanistic and data-driven models
Ye, Qing
Duan, Peibo
Kuang, Shibo
Ji, Li
Zou, Ruiping
Yu, Aibing
POWDER TECHNOLOGY, 2022, 407

← 1 2 3 4 5 →