Deep reinforcement learning for cooperative robots based on adaptive sentiment feedback

被引:3
|
作者
Jeon, Haein [1 ]
Kim, Dae-Won [2 ]
Kang, Bo-Yeong [3 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
[2] Chung Ang Univ, Sch Comp Sci & Engn, Seoul 06974, South Korea
[3] Kyungpook Natl Univ, Dept Robot & Smart Syst Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
Human-robot interaction; Deep reinforcement learning; Interactive reinforcement learning; Human-in-the-loop; Reward shaping;
D O I
10.1016/j.eswa.2023.121198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human-robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback.
引用
下载
收藏
页数:11
相关论文
共 50 条
  • [1] Deep Reinforcement Learning based Indoor Air Quality Sensing by Cooperative Mobile Robots
    Hu, Zhiwen
    Song, Tiankuo
    Biant, Kaigui
    Song, Lingyang
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [2] Deep Reinforcement Learning for the Autonomous Adaptive Behavior of Social Robots
    Maroto-Gomez, Marcos
    Malfaz, Maria
    Castro-Gonzalez, Alvaro
    Angel Salichs, Miguel
    SOCIAL ROBOTICS, ICSR 2022, PT I, 2022, 13817 : 208 - 217
  • [3] Adaptive Actuation of Magnetic Soft Robots Using Deep Reinforcement Learning
    Yao, Jianpeng
    Cao, Quanliang
    Ju, Yuwei
    Sun, Yuxuan
    Liu, Ruiqi
    Han, Xiaotao
    Li, Liang
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (02)
  • [4] Deep Reinforcement Learning for Path Planning by Cooperative Robots: Existing Approaches and Challenges
    Othman, Walaa
    Shilov, Nikolay
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 350 - 357
  • [5] Deep Reinforcement Learning-Based Adaptive IRS Control with Limited Feedback Codebooks
    Kim, Junghoon
    Hosseinalipour, Seyyedali
    Marcum, Andrew C.
    Kim, Taejoon
    Love, David J.
    Brinton, Christopher G.
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 5171 - 5177
  • [6] Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning
    Nan, Abhishek
    Perumal, Anandh
    Zaiane, Osmar R.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 167 - 180
  • [7] Adaptive beamforming based on the deep reinforcement learning
    Hao, Chuanhui
    Sun, Xubao
    Liu, Yidong
    ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
  • [8] Cooperative Proactive Eavesdropping Based on Deep Reinforcement Learning
    Yang, Yaxin
    Li, Baogang
    Zhang, Shue
    Zhao, Wei
    Zhang, Haijun
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (09) : 1857 - 1861
  • [9] Altruistic cooperative adaptive cruise control of mixed traffic platoon based on deep reinforcement learning
    Lu, Sikai
    Cai, Yingfeng
    Chen, Long
    Wang, Hai
    Sun, Xiaoqiang
    Gao, Hongbo
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (10) : 1951 - 1963
  • [10] Sentiment Analysis and Deep Learning Based Chatbot for User Feedback
    Nivethan
    Sankar, Sriram
    INTELLIGENT COMMUNICATION TECHNOLOGIES AND VIRTUAL MOBILE NETWORKS, ICICV 2019, 2020, 33 : 231 - 237