STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning

被引:0
|
作者
Singh, Nikhil Kumar [1 ]
Saha, Indranil [1 ]
机构
[1] IIT Kanpur, Dept Comp Sci & Engn, Kanpur, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers (agents) for various complex systems with unknown dynamics. These systems are expected to satisfy diverse safety and liveness properties best captured using temporal logic. In RL, the reward function plays a crucial role in specifying the desired behaviour of these agents. However, the problem of designing the reward function for an RL agent to satisfy complex temporal logic specifications has received limited attention in the literature. To address this, we provide a systematic way of generating rewards in real-time by using the quantitative semantics of Signal Temporal Logic (STL), a widely used temporal logic to specify the behaviour of cyber-physical systems. We propose a new quantitative semantics for STL having several desirable properties, making it suitable for reward generation. We evaluate our STL-based reinforcement learning mechanism on several complex continuous control benchmarks and compare our STL semantics with those available in the literature in terms of their efficacy in synthesizing the controller agent. Experimental results establish our new semantics to be the most suitable for synthesizing feedback controllers for complex continuous dynamical systems through reinforcement learning.
引用
收藏
页码:15118 / 15126
页数:9
相关论文
共 50 条
  • [41] Tracking Control of a Continuous Stirred Tank Reactor Using Direct and Tuned Reinforcement Learning Based Controllers
    Pandian, B. Jaganatha
    Noel, Mathew M.
    CHEMICAL PRODUCT AND PROCESS MODELING, 2018, 13 (03):
  • [42] Reinforcement Learning with Trajectory Feedback
    Efroni, Yonathan
    Merlis, Nadav
    Mannor, Shie
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7288 - 7295
  • [43] Reinforcement learning and the feedback ERN
    Holroyd, C
    PSYCHOPHYSIOLOGY, 2004, 41 : S14 - S14
  • [44] On the Search for Feedback in Reinforcement Learning
    Wang, Ran
    Parunandi, Karthikeya S.
    Sharma, Aayushman
    Goyal, Raman
    Chakravorty, Suman
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1560 - 1567
  • [45] Reinforcement Learning with Feedback Graphs
    Dann, Christoph
    Mansour, Yishay
    Mohri, Mehryar
    Sekhari, Ayush
    Sridharan, Karthik
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] Deep Reinforcement Learning with Embedded LQR Controllers
    Caarls, Wouter
    IFAC PAPERSONLINE, 2020, 53 (02): : 8063 - 8069
  • [47] Safe Exploration Algorithms for Reinforcement Learning Controllers
    Mannucci, Tommaso
    van Kampen, Erik-Jan
    de Visser, Cornelis
    Chu, Qiping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 1069 - 1081
  • [48] Due date assignment using feedback control with reinforcement learning
    Moses, SA
    IIE TRANSACTIONS, 1999, 31 (10) : 989 - 999
  • [49] Learning alternative movement coordination patterns using reinforcement feedback
    Lin, Tzu-Hsiang
    Denomme, Amber
    Ranganathan, Rajiv
    EXPERIMENTAL BRAIN RESEARCH, 2018, 236 (05) : 1395 - 1407
  • [50] Learning alternative movement coordination patterns using reinforcement feedback
    Tzu-Hsiang Lin
    Amber Denomme
    Rajiv Ranganathan
    Experimental Brain Research, 2018, 236 : 1395 - 1407