Discrete fully probabilistic design: towards a control pipeline for the synthesis of policies from examples

被引:0
|
作者
Ferrentino, Enrico [1 ]
Chiacchio, Pasquale [1 ]
Russo, Giovanni [1 ]
机构
[1] Univ Salerno, Dept Comp & Elect Engn & Appl Math DIEM, I-84084 Fisciano, SA, Italy
关键词
MARKOV DECISION-PROCESSES;
D O I
10.1109/MED59994.2023.10185706
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present the principled design of a control pipeline for the synthesis of policies from examples data. The pipeline, based on a discretized design, expounds the algorithm introduced in [1] to synthesize policies from examples for constrained, stochastic and nonlinear systems. The pipeline: (i) does not need the constraints to be fulfilled in the possibly noisy example data; (ii) enables control synthesis even when the data are collected from an example system that is different from the one under control. The design is benchmarked on an example that involves controlling an inverted pendulum with actuation constraints. The data that are used to synthesize the policy are collected from a pendulum that: (i) is different from the one under control; (ii) does not satisfy the actuation constraints.
引用
收藏
页码:759 / 764
页数:6
相关论文
共 50 条
  • [1] Towards fully probabilistic control design
    Karny, M
    AUTOMATICA, 1996, 32 (12) : 1719 - 1722
  • [2] Control synthesis using the Fully Probabilistic Design
    Lebeda, Ales
    Leth, John
    2016 IEEE CONFERENCE ON COMPUTER AIDED CONTROL SYSTEM DESIGN (CACSD), 2016, : 1458 - 1463
  • [3] Fully probabilistic control design
    Kárny, M
    Guy, TV
    SYSTEMS & CONTROL LETTERS, 2006, 55 (04) : 259 - 265
  • [4] Fully Probabilistic Design for Stochastic Discrete System with Multiplicative Noise
    Zhou, Yuyang
    Herzallah, Randa
    Zafar, Ana
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 940 - 945
  • [5] Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies
    Karny, Miroslav
    Hula, Frantisek
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 857 - 864
  • [6] Fully probabilistic control design in an adaptive critic framework
    Herzallah, Randa
    Karny, Miroslav
    NEURAL NETWORKS, 2011, 24 (10) : 1128 - 1135
  • [7] A Fully Probabilistic Decentralised Control Design for Complex Stochastic Systems
    Herzallah, Randa
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 806 - 811
  • [8] Towards probabilistic intrusion detection in supervisory control of discrete event systems
    Meira-Goes, Romulo
    Keroglou, Christoforos
    Lafortune, Stephane
    IFAC PAPERSONLINE, 2020, 53 (02): : 1776 - 1782
  • [9] A Fully Probabilistic Design for Tracking Control for Stochastic Systems With Input Delay
    Herzallah, Randa
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (09) : 4342 - 4348
  • [10] The synthesis of safe control policies in decentralized control of discrete-event systems
    Rohloff, K
    Lafortune, S
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) : 1064 - 1068