Distant speech emotion recognition in an indoor human-robot interaction scenario

被引:0
|
作者
Grageda, Nicolas [1 ]
Busso, Carlos [2 ]
Alvarado, Eduardo [1 ]
Mahu, Rodrigo [1 ]
Yoma, Nestor Becerra [1 ]
机构
[1] Univ Chile, Dept Elect Engn, Speech Proc & Transmiss Lab, Santiago, Chile
[2] Univ Texas Dallas, Dept Elect & Comp Engn, MSP Lab, Richardson, TX 75083 USA
来源
关键词
speech emotion recognition; human-computer interaction;
D O I
10.21437/Interspeech.2023-1169
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Social robotics and human-robot partnership are becoming very relevant topics defining many challenges for state-of-the-art speech technology. This paper presents the first evaluation of speech emotion recognition (SER) technology with non-acted speech data recorded in a real indoor human-robot interaction (HRI) scenario. The challenge is typified by distant speech processing, reverberation, and additive external and robot engine noise. We train and evaluate a machine learning-based based on simulated acoustic modelling that includes room impulse responses (RIRs), external noise, and beamforming response. We observe increased performance in the prediction of arousal, valence, and dominance with the proposed training procedure combined with delay-and-sum and minimum variance distortionless response (MVDR), with gain as high as 180%, compared with the result obtained with the model trained with the original data in controlled environments. Moreover, the degradation achieved when compared with the original matched training/testing condition is just 39%.
引用
收藏
页码:3657 / 3661
页数:5
相关论文
共 50 条
  • [1] Emotion Recognition From Speech to Improve Human-robot Interaction
    Zhu, Changrui
    Ahamd, Wasim
    IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 370 - 375
  • [2] Emotion in human-robot interaction: Recognition and display
    Wendt, Cornalia
    Kuehnlenz, Kolja
    Popp, Michael
    Karg, Michella
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 578 - 578
  • [3] Speech emotion recognition in real static and dynamic human-robot interaction scenarios
    Grageda, Nicolas
    Busso, Carlos
    Alvarado, Eduardo
    Garcia, Ricardo
    Mahu, Rodrigo
    Huenupan, Fernando
    Yoma, Nestor Becerra
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [4] Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction
    Zhang, Shiqing
    Zhao, Xiaoming
    Lei, Bicheng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [5] On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
    Lakomkin, Egor
    Zamani, Mohammad Ali
    Weber, Cornelius
    Magg, Sven
    Wermter, Stefan
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 854 - 860
  • [6] Multi-party Human-Robot Interaction with Distant-Talking Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 439 - 446
  • [7] Emotion Recognition in Human-Robot Interaction Using the NAO Robot
    Valagkouti, Iro Athina
    Troussas, Christos
    Krouska, Akrivi
    Feidakis, Michalis
    Sgouropoulou, Cleo
    COMPUTERS, 2022, 11 (05)
  • [8] Object recognition through human-robot interaction by speech
    Kurnia, R
    Hossain, A
    Nakamura, A
    Kuno, Y
    RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 619 - 624
  • [9] Interaction Intention Recognition via Human Emotion for Human-Robot Natural Interaction
    Yang, Shengtian
    Guan, Yisheng
    Li, Yihui
    Shi, Wenjing
    2022 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2022, : 380 - 385
  • [10] Minimal representation of speech signals for generation of emotion speech and human-robot interaction
    Lee, Heyoung
    Bien, Z. Zenn
    2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 137 - +