Distant speech emotion recognition in an indoor human-robot interaction scenario

被引:0
|
作者
Grageda, Nicolas [1 ]
Busso, Carlos [2 ]
Alvarado, Eduardo [1 ]
Mahu, Rodrigo [1 ]
Yoma, Nestor Becerra [1 ]
机构
[1] Univ Chile, Dept Elect Engn, Speech Proc & Transmiss Lab, Santiago, Chile
[2] Univ Texas Dallas, Dept Elect & Comp Engn, MSP Lab, Richardson, TX 75083 USA
来源
关键词
speech emotion recognition; human-computer interaction;
D O I
10.21437/Interspeech.2023-1169
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Social robotics and human-robot partnership are becoming very relevant topics defining many challenges for state-of-the-art speech technology. This paper presents the first evaluation of speech emotion recognition (SER) technology with non-acted speech data recorded in a real indoor human-robot interaction (HRI) scenario. The challenge is typified by distant speech processing, reverberation, and additive external and robot engine noise. We train and evaluate a machine learning-based based on simulated acoustic modelling that includes room impulse responses (RIRs), external noise, and beamforming response. We observe increased performance in the prediction of arousal, valence, and dominance with the proposed training procedure combined with delay-and-sum and minimum variance distortionless response (MVDR), with gain as high as 180%, compared with the result obtained with the model trained with the original data in controlled environments. Moreover, the degradation achieved when compared with the original matched training/testing condition is just 39%.
引用
收藏
页码:3657 / 3661
页数:5
相关论文
共 50 条
  • [41] Human Posture Recognition for Human-Robot Interaction
    Wei, Shiheng
    Jiang, Wei
    2011 3RD WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING (ACC 2011), VOL 4, 2011, 4 : 305 - 310
  • [42] Salience-driven Contextual Priming of Speech Recognition for Human-Robot Interaction
    Lison, Pierre
    Kruijff, Geert-Jan
    ECAI 2008, PROCEEDINGS, 2008, 178 : 636 - +
  • [43] Audio-Visual Speech Recognition for Human-Robot Interaction: a Feasibility Study
    Goetzee, Sander
    Mihhailov, Konstantin
    van de laar, Roel
    Baraka, Kim
    Hindriks, Koen V.
    2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 930 - 935
  • [44] Noisy Environment-Aware Speech Enhancement for Speech Recognition in Human-Robot Interaction Application
    Lee, Sheng-Chieh
    Chen, Bo-Wei
    Wang, Jhing-Fa
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [45] Space, Speech, and Gesture in Human-Robot Interaction
    Mead, Ross
    ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 333 - 336
  • [46] Integration of Gestures and Speech in Human-Robot Interaction
    Meena, Raveesh
    Jokinen, Kristiina
    Wilcock, Graham
    3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 673 - 678
  • [47] Environment Compensation Using A Posteriori Statistics for Distant Speech-based Human-Robot Interaction
    Gomez, Randy
    Nakamura, Keisuke
    2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 1211 - 1216
  • [48] Effects of Emotion Grouping for Recognition in Human-Robot Interactions
    Tozadore, Daniel C.
    Ranieri, Caetano M.
    Nardari, Guilherme V.
    Romero, Roseli A. F.
    Guizilini, Vitor C.
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 438 - 443
  • [49] Respiratory distress estimation in human-robot interaction scenario
    Alvarado, Eduardo
    Grageda, Nicolas
    Luzanto, Alejandro
    Mahu, Rodrigo
    Wuth, Jorge
    Mendoza, Laura
    Stern, Richard
    Yoma, Nestor Becerra
    INTERSPEECH 2023, 2023, : 1763 - 1767
  • [50] Effect of Scenario Media on Human-Robot Interaction Evaluation
    Xu, Qianli
    Ng, Jamie Suat Ling
    Cheong, Yian Ling
    Tan, Odelia Yiling
    Bin Wong, Ji
    Tay, Benedict Tiong Chee
    Park, Taezoon
    HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 275 - 276