STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning

被引:0
|
作者
Singh, Nikhil Kumar [1 ]
Saha, Indranil [1 ]
机构
[1] IIT Kanpur, Dept Comp Sci & Engn, Kanpur, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers (agents) for various complex systems with unknown dynamics. These systems are expected to satisfy diverse safety and liveness properties best captured using temporal logic. In RL, the reward function plays a crucial role in specifying the desired behaviour of these agents. However, the problem of designing the reward function for an RL agent to satisfy complex temporal logic specifications has received limited attention in the literature. To address this, we provide a systematic way of generating rewards in real-time by using the quantitative semantics of Signal Temporal Logic (STL), a widely used temporal logic to specify the behaviour of cyber-physical systems. We propose a new quantitative semantics for STL having several desirable properties, making it suitable for reward generation. We evaluate our STL-based reinforcement learning mechanism on several complex continuous control benchmarks and compare our STL semantics with those available in the literature in terms of their efficacy in synthesizing the controller agent. Experimental results establish our new semantics to be the most suitable for synthesizing feedback controllers for complex continuous dynamical systems through reinforcement learning.
引用
收藏
页码:15118 / 15126
页数:9
相关论文
共 50 条
  • [31] Tuning path tracking controllers for autonomous cars using reinforcement learning
    Carrasco, Ana Vilaca
    Sequeira, Joao Silva
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [32] Tuning path tracking controllers for autonomous cars using reinforcement learning
    Carrasco A.V.
    Sequeira J.S.
    PeerJ Computer Science, 2023, 9
  • [33] Automated design of adaptive controllers for modular robots using reinforcement learning
    Varshavskaya, Paulina
    Kaelbling, Leslie Pack
    Rus, Daniela
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (3-4): : 505 - 526
  • [34] KINEMATIC SYNTHESIS USING REINFORCEMENT LEARNING
    Vermeer, Kaz
    Kuppens, Reinier
    Herder, Justus
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2018, VOL 2A, 2018,
  • [35] Learning a Swarm Foraging Behavior with Microscopic Fuzzy Controllers Using Deep Reinforcement Learning
    Aznar, Fidel
    Pujol, Mar
    Rizo, Ramon
    APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [36] Reinforcement Learning Based Neural Controllers for Dynamic Processes without Exploration
    Steege, Frank-Florian
    Hartmann, Andre
    Schaffernicht, Erik
    Gross, Horst-Michael
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT II, 2010, 6353 : 222 - +
  • [37] Learning to Tune a Class of Controllers with Deep Reinforcement Learning
    Shipman, William John
    MINERALS, 2021, 11 (09)
  • [38] Learning to coordinate controllers - Reinforcement learning on a control basis
    Huber, M
    Grupen, RA
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 1366 - 1371
  • [39] Robotic Arm Representation Using Image-Based Feedback for Deep Reinforcement Learning
    Al-Zabt, Abdullah
    Tutunji, Tarek A.
    2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 168 - 173
  • [40] Feedback for reinforcement learning based brain-machine interfaces using confidence metrics
    Prins, Noeline W.
    Sanchez, Justin C.
    Prasad, Abhishek
    JOURNAL OF NEURAL ENGINEERING, 2017, 14 (03)