Detection of Emotional Hotspots in Meetings Using a Cross-Corpus Approach

被引:0
|
作者
Stemmer, Georg [1 ]
Lopez Meyer, Paulo [2 ]
Del Hoyo Ontiveros, Juan [2 ]
Lopez, Jose A. [3 ]
Cordourier Maruri, Hector A. [2 ]
Bocklet, Tobias [1 ]
机构
[1] Intel Corp, Neubiberg, Germany
[2] Intel Corp, Mexico City, DF, Mexico
[3] Intel Corp, Santa Clara, CA USA
来源
关键词
emotion recognition; human-computer interaction; computational paralinguistics; RECOGNITION;
D O I
10.21437/Interspeech.2023-1023
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition for natural human-to-human conversations has many useful applications, including generating comprehensive meeting transcripts or detecting communication problems. We investigate the detection of emotional hotspots, i.e., regions of increased speaker involvement in technical meetings. As there is a scarcity of annotated, not-acted corpora, and to avoid introducing unwanted biases to our models, we follow a cross-corpus approach where models are trained on data from domains unrelated to the test data. In this work we propose a model ensemble trained on spontaneous phone conversations, political discussions and acted emotions. Evaluation is performed on the natural ICSI and AMI meeting corpora, where we used existing hotspot annotations for ICSI and created labels for the AMI corpus. A semi-supervised fine-tuning procedure is introduced to adapt the model. We show that an equal error rate of below 21% can be achieved using the proposed cross-corpus approach.
引用
收藏
页码:1020 / 1024
页数:5
相关论文
共 50 条
  • [41] UNSUPERVISED CROSS-CORPUS SPEECH EMOTION RECOGNITION USING DOMAIN-ADAPTIVE SUBSPACE LEARNING
    Liu, Na
    Zong, Yuan
    Zhang, Baofeng
    Liu, Li
    Chen, Jie
    Zhao, Guoying
    Zhu, Junchao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5144 - 5148
  • [42] Cross-corpus Speech Emotion Recognition Using Transfer Semi-supervised Discriminant Analysis
    Song, Peng
    Zhang, Xinran
    Ou, Shifeng
    Liu, Jingjing
    Yu, Yanwei
    Zheng, Wenming
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [43] CROSS-CORPUS ACOUSTIC EMOTION RECOGNITION FROM SINGING AND SPEAKING: A MULTI-TASK LEARNING APPROACH
    Zhang, Biqiao
    Provost, Emily Mower
    Essl, Georg
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5805 - 5809
  • [44] A Cross-Corpus Speech-Based Analysis of Escalating Negative Interactions
    Lefter, Iulia
    Baird, Alice
    Stappen, Lukas
    Schuller, Bjorn W.
    FRONTIERS IN COMPUTER SCIENCE, 2022, 4
  • [45] Implicitly Aligning Joint Distributions for Cross-Corpus Speech Emotion Recognition
    Lu, Cheng
    Zong, Yuan
    Tang, Chuangao
    Lian, Hailun
    Chang, Hongli
    Zhu, Jie
    Li, Sunan
    Zhao, Yan
    ELECTRONICS, 2022, 11 (17)
  • [46] Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation
    Kathani, Alexander
    Buerger, Martin
    Triantafyllopoulos, Andreas
    Milkus, Sabrina
    Hohmann, Jonas
    Muderlak, Pauline
    Schottdorr, Jurgen
    Musil, Richard
    Schuller, Bjorn W.
    Amiriparian, Shahin
    INTERSPEECH 2024, 2024, : 487 - 491
  • [47] Synthesized speech for model training in cross-corpus recognition of human emotion
    Schuller, Bjorn
    Zhang, Zixing
    Weninger, Felix
    Burkhardt, Felix
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (03) : 313 - 323
  • [48] Robust Transferable Subspace Learning for Cross-Corpus Facial Expression Recognition
    Chen, Dongliang
    Song, Peng
    Zhang, Wenjing
    Zhang, Weijian
    Xu, Bingui
    Zhou, Xuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10): : 2241 - 2245
  • [49] Cross-Corpus Speech Emotion Recognition Based on Hybrid Neural Networks
    Rehman, Abdul
    Liu, Zhen-Tao
    Li, Dan-Yun
    Wu, Bao-Han
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7464 - 7468
  • [50] DOMAIN GENERALIZATION WITH TRIPLET NETWORK FOR CROSS-CORPUS SPEECH EMOTION RECOGNITION
    Lee, Shi-wook
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 389 - 396