Semi-supervised Emotion Recognition using Inconsistently Annotated Data

被引:2
|
作者
Happy, S. L. [1 ]
Dantcheva, Antitza [1 ]
Bremond, Francois [1 ]
机构
[1] Univ Cote Azur, Inria, Nice, France
关键词
FACIAL EXPRESSION RECOGNITION;
D O I
10.1109/FG47880.2020.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Expression recognition remains challenging, predominantly due to (a) lack of sufficient data, (b) subtle emotion intensity, (c) subjective and inconsistent annotation, as well as due to (d) in-the-wild data containing variations in pose, intensity, and occlusion. To address such challenges in a unified framework, we propose a self-training based semi-supervised convolutional neural network (CNN) framework, which directly addresses the problem of (a) limited data by leveraging information from unannotated samples. Our method uses 'successive label smoothing' to adapt to the subtle expressions and improve the model performance for (b) low-intensity expression samples. Further, we address (c) inconsistent annotations by assigning sample weights during loss computation, thereby ignoring the effect of incorrect ground-truth. We observe significant performance improvement in in-the-wild datasets by leveraging the information from the in-the-lab datasets, related to challenge (d). Associated to that, experiments on four publicly available datasets demonstrate large performance gains in cross-database performance, as well as show that the proposed method achieves to learn different expression intensities, even when trained with categorical samples.
引用
收藏
页码:286 / 293
页数:8
相关论文
共 50 条
  • [1] Speech emotion recognition using semi-supervised discriminant analysis
    Zhao, L. (zhaoli@seu.edu.cn), 1600, Southeast University (30):
  • [2] Semi-supervised Model for Emotion Recognition in Speech
    Pereira, Ingryd
    Santos, Diego
    Maciel, Alexandre
    Barros, Pablo
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 791 - 800
  • [3] Emotion recognition using semi-supervised feature selection with speaker normalization
    Sun Y.
    Wen G.
    International Journal of Speech Technology, 2015, 18 (3) : 317 - 331
  • [4] Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Niu, Mingyue
    Yi, Jiangyan
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [5] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    International Journal of Automation and Computing, 2019, 16 : 437 - 448
  • [6] Semi-Supervised Speech Emotion Recognition With Ladder Networks
    Parthasarathy, Srinivas
    Busso, Carlos
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2697 - 2709
  • [7] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Tao, Jian-Hua
    Huang, Jian
    Li, Ya
    Lian, Zheng
    Niu, Ming-Yue
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2019, 16 (04) : 437 - 448
  • [8] Semi-Supervised Speech Emotion Recognition with Ladder Networks
    Parthasarathy, Srinivas
    Busso, Carlos
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2020, 28 : 2697 - 2709
  • [9] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    International Journal of Automation and Computing, 2019, 16 (04) : 437 - 448
  • [10] Semi-Supervised Multimodal Emotion Recognition with Expression MAE
    Cheng, Zebang
    Lin, Yuxiang
    Chen, Zhaoru
    Li, Xiang
    Mao, Shuyi
    Zhang, Fan
    Ding, Daijun
    Zhang, Bowen
    Peng, Xiaojiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9436 - 9440