Multimodal shared features learning for emotion recognition by enhanced sparse local discriminative canonical correlation analysis

被引:13
|
作者
Fu, Jiamin [1 ]
Mao, Qirong [1 ]
Tu, Juanjuan [2 ]
Zhan, Yongzhao [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Multimodal emotion recognition; Multimodal shared feature learning; Multimodal information fusion; Canonical correlation analysis;
D O I
10.1007/s00530-017-0547-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimodal emotion recognition is a challenging research topic which has recently started to attract the attention of the research community. To better recognize the video users' emotion, the research of multimodal emotion recognition based on audio and video is essential. Multimodal emotion recognition performance heavily depends on finding good shared feature representation. The good shared representation needs to consider two aspects: (1) it has the character of each modality and (2) it can balance the effect of different modalities to make the decision optimal. In the light of these, we propose a novel Enhanced Sparse Local Discriminative Canonical Correlation Analysis approach (En-SLDCCA) to learn the multimodal shared feature representation. The shared feature representation learning involves two stages. In the first stage, we pretrain the Sparse Auto-Encoder with unimodal video (or audio), so that we can obtain the hidden feature representation of video and audio separately. In the second stage, we obtain the correlation coefficients of video and audio using our En-SLDCCA approach, then we form the shared feature representation which fuses the features from video and audio using the correlation coefficients. We evaluate the performance of our method on the challenging multimodal Enterface'05 database. Experimental results reveal that our method is superior to the unimodal video (or audio) and improves significantly the performance for multimodal emotion recognition when compared with the current state of the art.
引用
收藏
页码:451 / 461
页数:11
相关论文
共 50 条
  • [41] Multimodal data fusion using sparse canonical correlation analysis and cooperative learning: a COVID-19 cohort study
    Er, Ahmet Gorkem
    Ding, Daisy Yi
    Er, Berrin
    Uzun, Mertcan
    Cakmak, Mehmet
    Sadee, Christoph
    Durhan, Gamze
    Ozmen, Mustafa Nasuh
    Tanriover, Mine Durusu
    Topeli, Arzu
    Aydin Son, Yesim
    Tibshirani, Robert
    Unal, Serhat
    Gevaert, Olivier
    NPJ DIGITAL MEDICINE, 2024, 7 (01)
  • [42] Multimodal recognition of posed ear and face based on kernel canonical correlation analysis
    Wang, Yu
    Mu, Zhichun
    Xu, Zhengguang
    Luo, Jiajia
    Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2008, 30 (10): : 1200 - 1204
  • [43] Multi-view Emotion Recognition Using Deep Canonical Correlation Analysis
    Qiu, Jie-Lin
    Liu, Wei
    Lu, Bao-Liang
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 221 - 231
  • [44] Speech Emotion Recognition Using Canonical Correlation Analysis and Probabilistic Neural Network
    Cen, Ling
    Ser, Wee
    Yu, Zhu Liang
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 859 - +
  • [45] K-Means Clustering-Based Kernel Canonical Correlation Analysis for Multimodal Emotion Recognition in Human-Robot Interaction
    Chen, Luefeng
    Wang, Kuanlin
    Li, Min
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (01) : 1016 - 1024
  • [46] Adaptive structured sparse multiview canonical correlation analysis for multimodal brain imaging association identification
    Du, Lei
    Wang, Huiai
    Zhang, Jin
    Zhang, Shu
    Guo, Lei
    Han, Junwei
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (04)
  • [47] Adaptive structured sparse multiview canonical correlation analysis for multimodal brain imaging association identification
    Lei Du
    Huiai Wang
    Jin Zhang
    Shu Zhang
    Lei Guo
    Junwei Han
    Science China Information Sciences, 2023, 66
  • [48] Adaptive structured sparse multiview canonical correlation analysis for multimodal brain imaging association identification
    Lei DU
    Huiai WANG
    Jin ZHANG
    Shu ZHANG
    Lei GUO
    Junwei HAN
    the Alzheimer's Disease Neuroimaging Initiative
    Science China(Information Sciences), 2023, 66 (04) : 212 - 227
  • [49] Integrating Multimodal Neuroimaging and Genetics: A Structurally-Linked Sparse Canonical Correlation Analysis Approach
    Chung, Jiwon
    Kim, Sunghun
    Won, Ji Hye
    Park, Hyunjin
    IEEE Journal of Translational Engineering in Health and Medicine, 2024, 12 : 659 - 667
  • [50] Graph Enhanced Canonical Correlation Analysis and Its Application to Image Recognition
    Su Shuzhi
    Xie Jun
    Ping Xinrui
    Gao Penglian
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (11) : 3342 - 3349