On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

被引:0
|
作者
Lakomkin, Egor [1 ]
Zamani, Mohammad Ali [1 ]
Weber, Cornelius [1 ]
Magg, Sven [1 ]
Wermter, Stefan [1 ]
机构
[1] Univ Hamburg, Dept Informat, Knowledge Technol Inst, Vogt Koelln Str 30, D-22527 Hamburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech emotion recognition (SER) is an important aspect of effective human-robot collaboration and received a lot of attention from the research community. For example, many neural network-based architectures were proposed recently and pushed the performance to a new level. However, the applicability of such neural SER models trained only on in-domain data to noisy conditions is currently under-researched. In this work, we evaluate the robustness of state-of-the-art neural acoustic emotion recognition models in human-robot interaction scenarios. We hypothesize that a robot's ego noise, room conditions, and various acoustic events that can occur in a home environment can significantly affect the performance of a model. We conduct several experiments on the iCub robot platform and propose several novel ways to reduce the gap between the model's performance during training and testing in real-world conditions. Furthermore, we observe large improvements in the model performance on the robot and demonstrate the necessity of introducing several data augmentation techniques like overlaying background noise and loudness variations to improve the robustness of the neural approaches.
引用
收藏
页码:854 / 860
页数:7
相关论文
共 50 条
  • [1] Emotion Recognition From Speech to Improve Human-robot Interaction
    Zhu, Changrui
    Ahamd, Wasim
    [J]. IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 370 - 375
  • [2] Emotion in human-robot interaction: Recognition and display
    Wendt, Cornalia
    Kuehnlenz, Kolja
    Popp, Michael
    Karg, Michella
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 578 - 578
  • [3] Speech emotion recognition in real static and dynamic human-robot interaction scenarios
    Grageda, Nicolas
    Busso, Carlos
    Alvarado, Eduardo
    Garcia, Ricardo
    Mahu, Rodrigo
    Huenupan, Fernando
    Yoma, Nestor Becerra
    [J]. COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [4] Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction
    Zhang, Shiqing
    Zhao, Xiaoming
    Lei, Bicheng
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [5] Emotion Recognition in Human-Robot Interaction Using the NAO Robot
    Valagkouti, Iro Athina
    Troussas, Christos
    Krouska, Akrivi
    Feidakis, Michalis
    Sgouropoulou, Cleo
    [J]. COMPUTERS, 2022, 11 (05)
  • [6] Object recognition through human-robot interaction by speech
    Kurnia, R
    Hossain, A
    Nakamura, A
    Kuno, Y
    [J]. RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 619 - 624
  • [7] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59 (59)
  • [8] Interaction Intention Recognition via Human Emotion for Human-Robot Natural Interaction
    Yang, Shengtian
    Guan, Yisheng
    Li, Yihui
    Shi, Wenjing
    [J]. 2022 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2022, : 380 - 385
  • [9] Human Activity Recognition through Recurrent Neural Networks for Human-Robot Interaction in Agriculture
    Anagnostis, Athanasios
    Benos, Lefteris
    Tsaopoulos, Dimitrios
    Tagarakis, Aristotelis
    Tsolakis, Naoum
    Bochtis, Dionysis
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 21
  • [10] Minimal representation of speech signals for generation of emotion speech and human-robot interaction
    Lee, Heyoung
    Bien, Z. Zenn
    [J]. 2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 137 - +