Direct gradient-based reinforcement learning for robot behavior learning

被引：0

作者：

El-Fakdi, Andres ^{[1
]}

Carreras, Marc ^{[1
]}

Ridao, Pere ^{[1
]}

机构：

[1] Univ Girona, Inst Informat & Applicat, Polytech 4,Campus Montilivi, E-17071 Girona, Spain

来源：

INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II | 2007年

关键词：

Robot Learning; autonomous robots;

D O I：

10.1007/978-1-4020-5626-0_21

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Autonomous Underwater Vehicles (AUV) represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the continuous scientific advances in underwater robotics but the increasing number of sub sea missions and its complexity ask for an automatization of submarine processes. This paper proposes a high-level control system for solving the action selection problem of an autonomous robot. The system is characterized by the use of Reinforcement Learning Direct Policy Search methods (RLDPS) for learning the internal state/action mapping of some behaviors. We demonstrate its feasibility with simulated experiments using the model of our underwater robot URIS in a target following task.

引用

页码：175 / +

页数：3

共 50 条

[1] Direct gradient-based reinforcement learning
Baxter, J
Bartlett, PL
[J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 271 - 274
[2] Two-step gradient-based reinforcement learning for underwater robotics behavior learning
El-Fakdi, Andres
Carreras, Marc
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (03) : 271 - 282
[3] A Gradient-based reinforcement learning model of market equilibration
He, Zhongzhi
[J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
[4] Estimation and approximation bounds for gradient-based reinforcement learning
Bartlett, PL
Baxter, J
[J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 133 - 150
[5] Inverse Reinforcement Learning from a Gradient-based Learner
Ramponi, Giorgia
Drappo, Gianluca
Restelli, Marcello
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[6] A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents
Zhang, Zhen
Wang, Dongqing
Zhao, Dongbin
Han, Qiaoni
Song, Tingting
[J]. IEEE ACCESS, 2018, 6 : 70223 - 70235
[7] Gradient-Based Inverse Risk-Sensitive Reinforcement Learning
Mazumdar, Eric
Ratliff, Lillian J.
Fiez, Tanner
Sastry, S. Shankar
[J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[8] Robot behavior adaptation for human-robot interaction based on policy gradient reinforcement learning
Mitsunaga, N
Smith, C
Kanda, T
Ishiguro, H
Hagita, N
[J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 1594 - 1601
[9] Gradient-based learning and optimization
Cao, XR
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
[10] Gradient-Based Minimization for Multi-Expert Inverse Reinforcement Learning
Tateo, Davide
Pirotta, Matteo
Restelli, Marcello
Bonarini, Andrea
[J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 815 - 822

← 1 2 3 4 5 →