Variational meta reinforcement learning for social robotics

被引:2
|
作者
Ballou, Anand [1 ]
Alameda-Pineda, Xavier [1 ]
Reinke, Chris [1 ]
机构
[1] Univ Grenoble Alpes, Grenoble INP, CNRS, RobotLearn,Inria,LJK, 655 Ave Europe, F-38334 Montbonnot St Martin, France
基金
欧盟地平线“2020”;
关键词
Social robotics; Meta-learning; Deep reinforcement learning; Radial-basis-functions; VISUAL NAVIGATION; ENVIRONMENTS; NETWORKS;
D O I
10.1007/s10489-023-04691-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing presence of robots in our everyday environments, improving their social skills is of utmost importance. Nonetheless, social robotics still faces many challenges. One bottleneck is that robotic behaviors often need to be adapted, as social norms depend strongly on the environment. For example, a robot should navigate more carefully around patients in a hospital than around workers in an office. In this work, we investigate meta-reinforcement learning (meta-RL) as a potential solution. Here, robot behaviors are learned via reinforcement learning, where a reward function needs to be chosen so that the robot learns an appropriate behavior for a given environment. We propose to use a variational meta-RL procedure that quickly adapts the robots' behavior to new reward functions. As a result, in a new environment different reward functions can be quickly evaluated and an appropriate one selected. The procedure learns a vectorized representation for reward functions and a meta-policy that can be conditioned to such a representation. Given observations from a new reward function, the procedure identifies its representation and conditions the meta-policy to it. While investigating the procedure's capabilities, we realized that it suffers from posterior collapse where only a subset of the dimensions in the representation encode useful information, resulting in reduced performance. Our second contribution, a radial basis function (RBF) layer, partially mitigates this negative effect. The RBF layer lifts the representation to a higher dimensional space, which is more easily exploitable for the meta-policy. We demonstrate the interest of the RBF layer and the usage of meta-RL for social robotics in four robotic simulation tasks.
引用
收藏
页码:27249 / 27268
页数:20
相关论文
共 50 条
  • [31] Robotics in construction: A critical review of the reinforcement learning and imitation learning paradigms
    Delgado, Juan Manuel Davila
    Oyedele, Lukumon
    [J]. ADVANCED ENGINEERING INFORMATICS, 2022, 54
  • [32] Variational Policy Chaining for Lifelong Reinforcement Learning
    Doyle, Christopher
    Gueriau, Maxime
    Dusparic, Ivana
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1546 - 1550
  • [33] Variational Quantum Circuits for Deep Reinforcement Learning
    Chen, Samuel Yen-Chi
    Yang, Chao-Han Huck
    Qi, Jun
    Chen, Pin-Yu
    Ma, Xiaoli
    Goan, Hsi-Sheng
    [J]. IEEE ACCESS, 2020, 8 : 141007 - 141024
  • [34] Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard
    Church, Alex
    Lloyd, John
    Hadsell, Raia
    Lepora, Nathan F.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 6145 - 6152
  • [35] Social robotics-Trust, learning, and social interaction
    Yang, Guang-Zhong
    Dario, Paolo
    Kragic, Danica
    [J]. SCIENCE ROBOTICS, 2018, 3 (21)
  • [36] Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
    Brunke, Lukas
    Greeff, Melissa
    Hall, Adam W.
    Yuan, Zhaocong
    Zhou, Siqi
    Panerati, Jacopo
    Schoellig, Angela P.
    [J]. ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 411 - 444
  • [37] Robust and efficient task scheduling for robotics applications with reinforcement learning
    Tejer, Mateusz
    Szczepanski, Rafal
    Tarczewski, Tomasz
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [38] Toward Faster Reinforcement Learning for Robotics: Using Gaussian Processes
    Younes, Ali
    Panov, Aleksandr I.
    [J]. ARTIFICIAL INTELLIGENCE, 2019, 11866 : 160 - 174
  • [39] Computational Optimization of Image-Based Reinforcement Learning for Robotics
    Ferraro, Stefano
    Van de Maele, Toon
    Mazzaglia, Pietro
    Verbelen, Tim
    Dhoedt, Bart
    [J]. SENSORS, 2022, 22 (19)
  • [40] RL-Studio: A Tool for Reinforcement Learning Methods in Robotics
    Fernandez de Cabo, Pedro
    Lucas, Ruben
    Arranz, Ignacio
    Paniego, Sergio
    Canas, Jose M.
    [J]. ROBOT2022: FIFTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 2, 2023, 590 : 502 - 513