Feature Fusion for Multimodal Emotion Recognition Based on Deep Canonical Correlation Analysis

被引:19
|
作者
Zhang, Ke [1 ,2 ]
Li, Yuanqing [1 ,2 ]
Wang, Jingyu [3 ]
Wang, Zhen [4 ]
Li, Xuelong [4 ]
机构
[1] Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Sch Astronaut, Xian 710072, Shaanxi, Peoples R China
[4] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Correlation; Emotion recognition; TV; Visualization; Analytical models; Logic gates; Deep canonical correlation analysis; gated recurrent unit; multimodal emotion recognition;
D O I
10.1109/LSP.2021.3112314
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fusion of multimodal features is a momentous problem for video emotion recognition. As the development of deep learning, directly fusing feature matrixes of each mode through neural networks at feature level becomes mainstream method. However, unlike unimodal issues, for multimodal analysis, finding the correlations between different modal is as important as discovering effective unimodal features. To make up the deficiency in unearthing the intrinsic relationships between multimodal, a novel modularized multimodal emotion recognition model based on deep canonical correlation analysis (MERDCCA) is proposed in this letter. In MERDCCA, four utterances are gathered as a new group and each utterance contains text, audio and visual information as multimodal input. Gated recurrent unit layers are used to extract the unimodal features. Deep canonical correlation analysis based on encoder-decoder network is designed to extract cross-modal correlations by maximizing the relevance between multimodal. The experiments on two public datasets show that MERDCCA achieves the better results.
引用
收藏
页码:1898 / 1902
页数:5
相关论文
共 50 条
  • [1] Canonical Correlation Analysis for Data Fusion in Multimodal Emotion Recognition
    Nemati, Shahla
    [J]. 2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 676 - 681
  • [2] Research on Feature Fusion for Emotion Recognition Based on Discriminative Canonical Correlation Analysis
    ChuqiLiu
    Li, Chao
    ZipingZhao
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2018), 2018, : 30 - 36
  • [3] Multimodal emotion recognition based on kernel canonical correlation analysis
    Li, Bo
    Qi, Lin
    Gao, Lei
    [J]. 2014 IEEE WORKSHOP ON ELECTRONICS, COMPUTER AND APPLICATIONS, 2014, : 934 - 937
  • [4] Multimodal Emotion Recognition Based on Feature Fusion
    Xu, Yurui
    Wu, Xiao
    Su, Hang
    Liu, Xiaorui
    [J]. 2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 7 - 11
  • [5] Deep Feature Extraction and Attention Fusion for Multimodal Emotion Recognition
    Yang, Zhiyi
    Li, Dahua
    Hou, Fazheng
    Song, Yu
    Gao, Qiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1526 - 1530
  • [6] Multimodal Emotion Recognition Using Deep Generalized Canonical Correlation Analysis with an Attention Mechanism
    Lan, Yu-Ting
    Liu, Wei
    Lu, Bao-Liang
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Feature fusion method based on canonical correlation analysis and handwritten character recognition
    Sun, QS
    Zeng, SG
    Heng, PA
    Xia, DS
    [J]. 2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 1547 - 1552
  • [8] SPARSITY PRESERVING MULTIPLE CANONICAL CORRELATION ANALYSIS WITH VISUAL EMOTION RECOGNITION TO MULTI-FEATURE FUSION
    Gao, Lei
    Qi, Lin
    Guan, Ling
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2710 - 2714
  • [9] Feature Fusion of Speech Emotion Recognition Based on Deep Learning
    Liu, Gang
    He, Wei
    Jin, Bicheng
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 193 - 197
  • [10] Palmprint and Palm Vein Feature Fusion Recognition Based on BSLDP and Canonical Correlation Analysis
    Li Xinchun
    Zhang Chunhua
    Lin Sen
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (05)