ORDINAL LEARNING FOR EMOTION RECOGNITION IN CUSTOMER SERVICE CALLS

被引:0
|
作者
Han, Wenjing [1 ]
Jiang, Tao [1 ]
Li, Yan [1 ]
Schuller, Bjoern [2 ,3 ]
Ruan, Huabin [4 ]
机构
[1] Kuaishou Technol Corp, Beijing, Peoples R China
[2] Imperial Coll London, GLAM Grp Language Audio & Mus, London, England
[3] Univ Augsburg, ZD B Chair Embedded Intelligence Hlth Care & Well, Augsburg, Germany
[4] Tsinghua Univ, Sch Life Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
speech emotion recognition; ordinal classification; consistent rank logits; VGGish;
D O I
10.1109/icassp40776.2020.9053648
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Approaches toward ordinal speech emotion recognition (SER) tasks are commonly based on the categorical classification algorithms, where the rank-order emotions are arbitrarily treated as independent categories. To employ the ordinal information between emotional ranks, we propose to model the ordinal SER tasks under a COnsistent RAnk Logits (CORAL) based deep learning framework. Specifically, a multi-class ordinal SER task is transformed into a series of binary SER sub-tasks predicting whether an utterance's emotion is larger than a rank. All the sub-tasks are jointly solved by one single network with a mislabelling cost defined as the the sum of the individual cross-entropy loss for each sub-task. Having the VGGish as our basic network structure, via minimizing above CORAL based cost, a VGGish-CORAL network is implemented in this contribution. Experimental results on a real-world call center dataset and the widely used IEMOCAP corpus demonstrate the effectiveness of VGGish-CORAL compared to the categorical VGGish.
引用
收藏
页码:6494 / 6498
页数:5
相关论文
共 50 条
  • [41] Emotion Recognition in Speech with Deep Learning Architectures
    Erdal, Mehmet
    Kaechele, Markus
    Schwenker, Friedhelm
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 298 - 311
  • [42] Dimensional Music Emotion Recognition by Machine Learning
    Bai, Junjie
    Feng, Lixiao
    Peng, Jun
    Shi, Jinliang
    Luo, Kan
    Li, Zuojin
    Liao, Lu
    Wang, Yingxu
    [J]. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2016, 10 (04) : 74 - 89
  • [43] Emotion Recognition in E-learning Systems
    El Hammoumi, Oussama
    Benmarrakchi, Fatimaezzahra
    Ouherrou, Nihal
    El Kafi, Jamal
    El Hore, Ali
    [J]. PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 52 - 57
  • [44] Speech emotion recognition with unsupervised feature learning
    Zheng-wei Huang
    Wen-tao Xue
    Qi-rong Mao
    [J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 358 - 366
  • [45] Meta-transfer learning for emotion recognition
    Nguyen, Dung
    Nguyen, Duc Thanh
    Sridharan, Sridha
    Denman, Simon
    Nguyen, Thanh Thi
    Dean, David
    Fookes, Clinton
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10535 - 10549
  • [46] Deep Learning and Audio Based Emotion Recognition
    Demir, Asli
    Atila, Orhan
    Sengur, Abdulkadir
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [47] CONTRASTIVE UNSUPERVISED LEARNING FOR SPEECH EMOTION RECOGNITION
    Li, Mao
    Yang, Bo
    Levy, Joshua
    Stolcke, Andreas
    Rozgic, Viktor
    Matsoukas, Spyros
    Papayiannis, Constantinos
    Bone, Daniel
    Wang, Chao
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6329 - 6333
  • [48] Meta-transfer learning for emotion recognition
    Dung Nguyen
    Duc Thanh Nguyen
    Sridha Sridharan
    Simon Denman
    Thanh Thi Nguyen
    David Dean
    Clinton Fookes
    [J]. Neural Computing and Applications, 2023, 35 : 10535 - 10549
  • [49] Harmonious Mutual Learning for Facial Emotion Recognition
    Yanling Gan
    Luhui Xu
    Haiying Xia
    Gan Liu
    [J]. Neural Processing Letters, 56
  • [50] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    [J]. INTERSPEECH 2020, 2020, : 4094 - 4097