Continuous affect recognition with weakly supervised learning

被引:8
|
作者
Pei, Ercheng [1 ]
Jiang, Dongmei [1 ]
Alioscha-Perez, Mitchel [2 ]
Sahli, Hichem [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, VUB NPU Joint AVSP Lab, Xian 710072, Peoples R China
[2] Vrije Univ Brussel, Dept ETRO, Pl Laan 2, B-1050 Brussels, Belgium
关键词
Continuous affect recognition; DNN-BLSTM; Weak supervision; FEATURE ENHANCEMENT; LSTM; CLASSIFICATION; NETWORKS; FEATURES;
D O I
10.1007/s11042-019-7313-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing a person's affective state from audio-visual signals is an essential capability for intelligent interaction. Insufficient training data and the unreliable labels of affective dimensions (e.g., valence and arousal) are two major challenges in continuous affect recognition. In this paper, we propose a weakly supervised learning approach based on hybrid deep neural network and bidirectional long short-term memory recurrent neural network (DNN-BLSTM). It firstly maps the audio/visual features into a more discriminative space via the powerful modelling capacities of DNN, then models the temporal dynamics of affect via BLSTM. To reduce the negative impact of the unreliable labels, we utilize a temporal label (TL) along with a robust loss function (RL) for incorporating weak supervision into the learning process of the DNN-BLSTM model. Therefore, the proposed method not only has a simpler structure than the deep BLSTM model in He et al. (24) which requires more training data, but also is robust to noisy and unreliable labels. Single modal and multimodal affect recognition experiments have been carried out on the RECOLA dataset. Single modal recognition results show that the proposed method with TL and RL obtains remarkable improvements on both arousal and valence in terms of concordance correlation coefficient (CCC), while multimodal recognition results show that with less feature streams, our proposed approach obtains better or comparable results with the state-of-the-art methods.
引用
收藏
页码:19387 / 19412
页数:26
相关论文
共 50 条
  • [31] GEOGRAPHIC INFORMATION USE IN WEAKLY-SUPERVISED DEEP LEARNING FOR LANDMARK RECOGNITION
    Yin, Yifang
    Liu, Zhenguang
    Zimmermann, Roger
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1015 - 1020
  • [32] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
    Fan Zhu
    Ling Shao
    International Journal of Computer Vision, 2014, 109 : 42 - 59
  • [33] Semi-Supervised Learning for Continuous Emotion Recognition Based on Metric Learning
    Choi, Dong Yoon
    Song, Byung Cheol
    IEEE ACCESS, 2020, 8 : 113443 - 113455
  • [34] UntrimmedNets for Weakly Supervised Action Recognition and Detection
    Wang, Limin
    Xiong, Yuanjun
    Lin, Dahua
    Van Gool, Luc
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6402 - 6411
  • [35] A weakly supervised approach for recycling code recognition
    Pellegrini, Lorenzo
    Maltoni, Davide
    Graffieti, Gabriele
    Lomonaco, Vincenzo
    Mazzini, Lisa
    Mondardini, Marco
    Zappoli, Milena
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [36] Weakly Supervised Transformer for Radar Jamming Recognition
    Zhang, Menglu
    Chen, Yushi
    Zhang, Ye
    REMOTE SENSING, 2024, 16 (14)
  • [37] Weakly Supervised Attention Networks for Entity Recognition
    Patra, Barun
    Moniz, Joel Ruben Antony
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6268 - 6273
  • [38] Continuous Sign Language Recognition Based on Pseudo-supervised Learning
    Pei, Xiankun
    Guo, Dan
    Zhao, Ye
    PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 33 - 39
  • [39] Weakly Supervised Human Activity Recognition From Wearable Sensors by Recurrent Attention Learning
    He, Jun
    Zhang, Qian
    Wang, Liqun
    Pei, Ling
    IEEE SENSORS JOURNAL, 2019, 19 (06) : 2287 - 2297
  • [40] Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle
    Tan, Min
    Wang, Baoyuan
    Wu, Zhaohui
    Wang, Jingdong
    Pan, Gang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (05) : 1415 - 1427