STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning

被引：0

作者：

Singh, Nikhil Kumar ^{[1
]}

Saha, Indranil ^{[1
]}

机构：

[1] IIT Kanpur, Dept Comp Sci & Engn, Kanpur, India

来源：

THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers (agents) for various complex systems with unknown dynamics. These systems are expected to satisfy diverse safety and liveness properties best captured using temporal logic. In RL, the reward function plays a crucial role in specifying the desired behaviour of these agents. However, the problem of designing the reward function for an RL agent to satisfy complex temporal logic specifications has received limited attention in the literature. To address this, we provide a systematic way of generating rewards in real-time by using the quantitative semantics of Signal Temporal Logic (STL), a widely used temporal logic to specify the behaviour of cyber-physical systems. We propose a new quantitative semantics for STL having several desirable properties, making it suitable for reward generation. We evaluate our STL-based reinforcement learning mechanism on several complex continuous control benchmarks and compare our STL semantics with those available in the literature in terms of their efficacy in synthesizing the controller agent. Experimental results establish our new semantics to be the most suitable for synthesizing feedback controllers for complex continuous dynamical systems through reinforcement learning.

引用

页码：15118 / 15126

页数：9

共 50 条

[31] Tuning path tracking controllers for autonomous cars using reinforcement learning
Carrasco, Ana Vilaca
Sequeira, Joao Silva
PEERJ COMPUTER SCIENCE, 2023, 9
[32] Tuning path tracking controllers for autonomous cars using reinforcement learning
Carrasco A.V.
Sequeira J.S.
PeerJ Computer Science, 2023, 9
[33] Automated design of adaptive controllers for modular robots using reinforcement learning
Varshavskaya, Paulina
Kaelbling, Leslie Pack
Rus, Daniela
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (3-4): : 505 - 526
[34] KINEMATIC SYNTHESIS USING REINFORCEMENT LEARNING
Vermeer, Kaz
Kuppens, Reinier
Herder, Justus
PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2018, VOL 2A, 2018,
[35] Learning a Swarm Foraging Behavior with Microscopic Fuzzy Controllers Using Deep Reinforcement Learning
Aznar, Fidel
Pujol, Mar
Rizo, Ramon
APPLIED SCIENCES-BASEL, 2021, 11 (06):
[36] Reinforcement Learning Based Neural Controllers for Dynamic Processes without Exploration
Steege, Frank-Florian
Hartmann, Andre
Schaffernicht, Erik
Gross, Horst-Michael
ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT II, 2010, 6353 : 222 - +
[37] Learning to Tune a Class of Controllers with Deep Reinforcement Learning
Shipman, William John
MINERALS, 2021, 11 (09)
[38] Learning to coordinate controllers - Reinforcement learning on a control basis
Huber, M
Grupen, RA
IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 1366 - 1371
[39] Robotic Arm Representation Using Image-Based Feedback for Deep Reinforcement Learning
Al-Zabt, Abdullah
Tutunji, Tarek A.
2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 168 - 173
[40] Feedback for reinforcement learning based brain-machine interfaces using confidence metrics
Prins, Noeline W.
Sanchez, Justin C.
Prasad, Abhishek
JOURNAL OF NEURAL ENGINEERING, 2017, 14 (03)

← 1 2 3 4 5 →