Deep reinforcement learning for cooperative robots based on adaptive sentiment feedback

被引:3
|
作者
Jeon, Haein [1 ]
Kim, Dae-Won [2 ]
Kang, Bo-Yeong [3 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
[2] Chung Ang Univ, Sch Comp Sci & Engn, Seoul 06974, South Korea
[3] Kyungpook Natl Univ, Dept Robot & Smart Syst Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
Human-robot interaction; Deep reinforcement learning; Interactive reinforcement learning; Human-in-the-loop; Reward shaping;
D O I
10.1016/j.eswa.2023.121198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human-robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Deep Reinforcement Learning with Feedback-based Exploration
    Scholten, Jan
    Wout, Daan
    Celemin, Carlos
    Kober, Jens
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 803 - 808
  • [12] Mapless navigation based on deep reinforcement learning for mobile robots
    Hu G.-M.
    Cai K.-W.
    Wang F.
    Kang Y.-W.
    Zhang J.-X.
    Jin Z.
    Lin Y.-S.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 985 - 993
  • [13] Motion Coordination of Multiple Robots Based on Deep Reinforcement Learning
    Hao, Xiuzhao
    Wu, Zhihao
    Zhou, Haiguang
    Bai, Xiangpeng
    Lin, Youfang
    Han, Sheng
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 955 - 962
  • [14] An adaptive deep reinforcement learning approach for MIMO PID control of mobile robots
    Carlucho, Ignacio
    De Paula, Mariano
    Acosta, Gerardo G.
    ISA TRANSACTIONS, 2020, 102 : 280 - 294
  • [15] A Stock Prediction Method Based on Deep Reinforcement Learning and Sentiment Analysis
    Du, Sha
    Shen, Hailong
    Applied Sciences (Switzerland), 2024, 14 (19):
  • [16] Incremental Learning for Autonomous Navigation of Mobile Robots based on Deep Reinforcement Learning
    Manh Luong
    Cuong Pham
    Journal of Intelligent & Robotic Systems, 2021, 101
  • [17] Incremental Learning for Autonomous Navigation of Mobile Robots based on Deep Reinforcement Learning
    Manh Luong
    Cuong Pham
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (01)
  • [18] A reinforcement learning adaptive fuzzy controller for robots
    Lin, CK
    FUZZY SETS AND SYSTEMS, 2003, 137 (03) : 339 - 352
  • [19] Online parameter adaptive control of mobile robots based on deep reinforcement learning under multiple optimisation objectives
    Sui, Xiuli
    Chen, Haiyong
    COGNITIVE COMPUTATION AND SYSTEMS, 2024, : 86 - 97
  • [20] Sentiment-influenced trading system based on multimodal deep reinforcement learning
    Chen, Yu-Fu
    Huang, Szu-Hao
    APPLIED SOFT COMPUTING, 2021, 112