Continuous affect recognition with weakly supervised learning

被引:8
|
作者
Pei, Ercheng [1 ]
Jiang, Dongmei [1 ]
Alioscha-Perez, Mitchel [2 ]
Sahli, Hichem [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, VUB NPU Joint AVSP Lab, Xian 710072, Peoples R China
[2] Vrije Univ Brussel, Dept ETRO, Pl Laan 2, B-1050 Brussels, Belgium
关键词
Continuous affect recognition; DNN-BLSTM; Weak supervision; FEATURE ENHANCEMENT; LSTM; CLASSIFICATION; NETWORKS; FEATURES;
D O I
10.1007/s11042-019-7313-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing a person's affective state from audio-visual signals is an essential capability for intelligent interaction. Insufficient training data and the unreliable labels of affective dimensions (e.g., valence and arousal) are two major challenges in continuous affect recognition. In this paper, we propose a weakly supervised learning approach based on hybrid deep neural network and bidirectional long short-term memory recurrent neural network (DNN-BLSTM). It firstly maps the audio/visual features into a more discriminative space via the powerful modelling capacities of DNN, then models the temporal dynamics of affect via BLSTM. To reduce the negative impact of the unreliable labels, we utilize a temporal label (TL) along with a robust loss function (RL) for incorporating weak supervision into the learning process of the DNN-BLSTM model. Therefore, the proposed method not only has a simpler structure than the deep BLSTM model in He et al. (24) which requires more training data, but also is robust to noisy and unreliable labels. Single modal and multimodal affect recognition experiments have been carried out on the RECOLA dataset. Single modal recognition results show that the proposed method with TL and RL obtains remarkable improvements on both arousal and valence in terms of concordance correlation coefficient (CCC), while multimodal recognition results show that with less feature streams, our proposed approach obtains better or comparable results with the state-of-the-art methods.
引用
收藏
页码:19387 / 19412
页数:26
相关论文
共 50 条
  • [1] Continuous affect recognition with weakly supervised learning
    Ercheng Pei
    Dongmei Jiang
    Mitchel Alioscha-Perez
    Hichem Sahli
    [J]. Multimedia Tools and Applications, 2019, 78 : 19387 - 19412
  • [2] An Investigation of Cross-Cultural Semi-Supervised Learning for Continuous Affect Recognition
    Mallol-Ragolta, Adria
    Cummins, Nicholas
    Schuller, Bjoern W.
    [J]. INTERSPEECH 2020, 2020, : 511 - 515
  • [3] Weakly Supervised Learning: Application to Fish School Recognition
    Lefort, Riwal
    Fablet, Ronan
    Boucher, Jean-Marc
    [J]. NEW ADVANCES IN INTELLIGENT SIGNAL PROCESSING, 2011, 372 : 203 - +
  • [4] Weakly Supervised Dual Learning for Facial Action Unit Recognition
    Wang, Shangfei
    Peng, Guozhu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3218 - 3230
  • [5] Weakly supervised graph learning for action recognition in untrimmed video
    Yao, Xiao
    Zhang, Jia
    Chen, Ruixuan
    Zhang, Dan
    Zeng, Yifeng
    [J]. VISUAL COMPUTER, 2023, 39 (11): : 5469 - 5483
  • [6] Medical Named Entity Recognition Using Weakly Supervised Learning
    Long-Long Ma
    Jie Yang
    Bo An
    Shuaikang Liu
    Gaijuan Huang
    [J]. Cognitive Computation, 2022, 14 : 1068 - 1079
  • [7] Learning Label Semantics for Weakly Supervised Group Activity Recognition
    Wu, Lifang
    Tian, Meng
    Xiang, Ye
    Gu, Ke
    Shi, Ge
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6386 - 6397
  • [8] Medical Named Entity Recognition Using Weakly Supervised Learning
    Ma, Long-Long
    Yang, Jie
    An, Bo
    Liu, Shuaikang
    Huang, Gaijuan
    [J]. COGNITIVE COMPUTATION, 2022, 14 (03) : 1068 - 1079
  • [9] Weakly supervised graph learning for action recognition in untrimmed video
    Xiao Yao
    Jia Zhang
    Ruixuan Chen
    Dan Zhang
    Yifeng Zeng
    [J]. The Visual Computer, 2023, 39 : 5469 - 5483
  • [10] Weakly supervised foreground learning for weakly supervised localization and detection
    Zhang, Chen -Lin
    Li, Yin
    Wu, Jianxin
    [J]. PATTERN RECOGNITION, 2023, 137