Direct gradient-based reinforcement learning for robot behavior learning

被引:0
|
作者
El-Fakdi, Andres [1 ]
Carreras, Marc [1 ]
Ridao, Pere [1 ]
机构
[1] Univ Girona, Inst Informat & Applicat, Polytech 4,Campus Montilivi, E-17071 Girona, Spain
关键词
Robot Learning; autonomous robots;
D O I
10.1007/978-1-4020-5626-0_21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous Underwater Vehicles (AUV) represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the continuous scientific advances in underwater robotics but the increasing number of sub sea missions and its complexity ask for an automatization of submarine processes. This paper proposes a high-level control system for solving the action selection problem of an autonomous robot. The system is characterized by the use of Reinforcement Learning Direct Policy Search methods (RLDPS) for learning the internal state/action mapping of some behaviors. We demonstrate its feasibility with simulated experiments using the model of our underwater robot URIS in a target following task.
引用
收藏
页码:175 / +
页数:3
相关论文
共 50 条
  • [1] Direct gradient-based reinforcement learning
    Baxter, J
    Bartlett, PL
    [J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 271 - 274
  • [2] Two-step gradient-based reinforcement learning for underwater robotics behavior learning
    El-Fakdi, Andres
    Carreras, Marc
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (03) : 271 - 282
  • [3] A Gradient-based reinforcement learning model of market equilibration
    He, Zhongzhi
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
  • [4] Estimation and approximation bounds for gradient-based reinforcement learning
    Bartlett, PL
    Baxter, J
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 133 - 150
  • [5] Inverse Reinforcement Learning from a Gradient-based Learner
    Ramponi, Giorgia
    Drappo, Gianluca
    Restelli, Marcello
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents
    Zhang, Zhen
    Wang, Dongqing
    Zhao, Dongbin
    Han, Qiaoni
    Song, Tingting
    [J]. IEEE ACCESS, 2018, 6 : 70223 - 70235
  • [7] Gradient-Based Inverse Risk-Sensitive Reinforcement Learning
    Mazumdar, Eric
    Ratliff, Lillian J.
    Fiez, Tanner
    Sastry, S. Shankar
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [8] Robot behavior adaptation for human-robot interaction based on policy gradient reinforcement learning
    Mitsunaga, N
    Smith, C
    Kanda, T
    Ishiguro, H
    Hagita, N
    [J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 1594 - 1601
  • [9] Gradient-based learning and optimization
    Cao, XR
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
  • [10] Gradient-Based Minimization for Multi-Expert Inverse Reinforcement Learning
    Tateo, Davide
    Pirotta, Matteo
    Restelli, Marcello
    Bonarini, Andrea
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 815 - 822