Deep reinforcement learning for cooperative robots based on adaptive sentiment feedback

被引:3
|
作者
Jeon, Haein [1 ]
Kim, Dae-Won [2 ]
Kang, Bo-Yeong [3 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
[2] Chung Ang Univ, Sch Comp Sci & Engn, Seoul 06974, South Korea
[3] Kyungpook Natl Univ, Dept Robot & Smart Syst Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
Human-robot interaction; Deep reinforcement learning; Interactive reinforcement learning; Human-in-the-loop; Reward shaping;
D O I
10.1016/j.eswa.2023.121198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human-robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An Improved Reinforcement Learning Algorithm for Cooperative Behaviors of Mobile Robots
    Song, Yong
    Li, Yibin
    Wang, Xiaoli
    Ma, Xin
    Ruan, Jiuhong
    JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2014, 2014
  • [32] A realization of socially adaptive robots by competitive reinforcement learning
    Nakayama, T
    Mikami, S
    Wada, M
    RO-MAN '96 - 5TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 1996, : 107 - 111
  • [33] An active pursuit strategy for autonomous mobile robots based on deep reinforcement learning
    Gao, Yingnan
    Cheng, Lei
    He, Yu
    Wang, Duanchu
    2022 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS, CBS, 2022, : 326 - 331
  • [34] Coverage path planning for kiwifruit picking robots based on deep reinforcement learning
    Wang, Yinchu
    He, Zhi
    Cao, Dandan
    Ma, Li
    Li, Kai
    Jia, Liangsheng
    Cui, Yongjie
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
  • [35] Feedback Control For Cassie With Deep Reinforcement Learning
    Xie, Zhaoming
    Berseth, Glen
    Clary, Patrick
    Hurst, Jonathan
    van de Panne, Michiel
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1241 - 1246
  • [36] Decision Making of Ball-Batting Robots Based on Deep Reinforcement Learning
    Hsiao, Tesheng
    Kao, Hsuan-Che
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 782 - 787
  • [37] Deep Reinforcement Learning-Based Control of Bicycle Robots on Rough Terrain
    Zhu, Xianjin
    Zheng, Xudong
    Deng, Yang
    Chen, Zhang
    Liang, Bin
    Liu, Yu
    2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 103 - 108
  • [38] Deep Reinforcement Learning Autoencoder with Noisy Feedback
    Goutay, Mathieu
    Aoudia, Faycal Ait
    Hoydis, Jakob
    17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 346 - 351
  • [39] Deep reinforcement learning for motion planning of mobile robots
    Sun H.-H.
    Hu C.-H.
    Zhang J.-G.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (06): : 1281 - 1292
  • [40] Deep Reinforcement Learning for Conversational Robots Playing Games
    Cuayahuitl, Heriberto
    2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 771 - 776