Deep reinforcement learning for cooperative robots based on adaptive sentiment feedback

被引:3
|
作者
Jeon, Haein [1 ]
Kim, Dae-Won [2 ]
Kang, Bo-Yeong [3 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
[2] Chung Ang Univ, Sch Comp Sci & Engn, Seoul 06974, South Korea
[3] Kyungpook Natl Univ, Dept Robot & Smart Syst Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
Human-robot interaction; Deep reinforcement learning; Interactive reinforcement learning; Human-in-the-loop; Reward shaping;
D O I
10.1016/j.eswa.2023.121198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human-robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Navigating Robots in Dynamic Environment With Deep Reinforcement Learning
    Zhou, Zhiqian
    Zeng, Zhiwen
    Lang, Lin
    Yao, Weijia
    Lu, Huimin
    Zheng, Zhiqiang
    Zhou, Zongtan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25201 - 25211
  • [42] Deep reinforcement learning for shared control of mobile robots
    Tian, Chong
    Shaik, Shahil
    Wang, Yue
    IET CYBER-SYSTEMS AND ROBOTICS, 2021, 3 (04) : 315 - 330
  • [43] On Training Flexible Robots using Deep Reinforcement Learning
    Dwiel, Zach
    Candadai, Madhavun
    Phielipp, Mariano
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4666 - 4671
  • [44] Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
    Das, Abhishek
    Kottur, Satwik
    Moura, Jose M. F.
    Lee, Stefan
    Batra, Dhruv
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2970 - 2979
  • [45] Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
    Mobasheri, Motahareh
    Kim, Yangwoo
    Kim, Woongsup
    SENSORS, 2021, 21 (21)
  • [46] Adaptive Trust Threshold Model Based on Reinforcement Learning in Cooperative Spectrum Sensing
    Xie, Gang
    Zhou, Xincheng
    Gao, Jinchun
    SENSORS, 2023, 23 (10)
  • [47] Blockchain-Integrated Multiagent Deep Reinforcement Learning for Securing Cooperative Adaptive Cruise Control
    Raja, Gunasekaran
    Kottursamy, Kottilingam
    Dev, Kapal
    Narayanan, Renuka
    Raja, Ashmitha
    Karthik, K. Bhavani Venkata
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9630 - 9639
  • [48] Adaptive Cooperative Distributed Compressed Sensing for Edge Devices: A Multiagent Deep Reinforcement Learning Approach
    Sekine, Masatoshi
    Ikada, Satoshi
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 585 - 591
  • [49] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach
    Desjardins, Charles
    Chaib-draa, Brahim
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) : 1248 - 1260
  • [50] Adaptive and Personalised Robots - Learning from Users' Feedback
    Karami, Abir B.
    Sehaba, Karim
    Encelle, Benoit
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 626 - 632