Low-rank Multimodal Fusion Algorithm Based on Context Modeling

被引:6
|
作者
Bai, Zongwen [1 ]
Chen, Xiaohuan [1 ]
Zhou, Meili [1 ]
Yi, Tingting [1 ]
Chien, Wei-Che [2 ]
机构
[1] Yanan Univ, Sch Phys & Elect Informat, Yanan, Peoples R China
[2] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2021年 / 22卷 / 04期
基金
中国国家自然科学基金;
关键词
Neural architecture search; Sequence regression models; Performance prediction; Network structure feature; NETWORK; COMMUNICATION;
D O I
10.53106/160792642021072204018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important part of human daily life, video contains rich emotion information. Therefore, it is a current research trend to find efficient approaches to conducting emotional analysis on videos. Based on tensor fusion, we propose a low-rank multimodal fusion context modeling. At the beginning, modality information is preprocessed by GRU (Gate Recurrent Unit) in Recurrent Neural Network. We construct semantic dependencies to convey contextual information in the context of the video. The proposed model improves performance of applied emotion classification. Additionally, LMF (Low-rank Tensor Multimodal Fusion) with the advantage of end-toend learning is implemented as a fusion mechanism to improve classification efficiency. We implemented the experiments on CMU-MOSI, POM, and IEMOCAP of multi-modal sentiment analysis, speaker traits and emotion recognition. And results show that our method improved the performance by a margin of 2.9%, 1.3%, and 12.2% respectively contrast with TFN (Tensor Fusion Network).
引用
收藏
页码:913 / 921
页数:9
相关论文
共 50 条
  • [1] Dual Low-Rank Multimodal Fusion
    Jin, Tao
    Huang, Siyu
    Li, Yingming
    Zhang, Zhongfei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 377 - 387
  • [2] Multimodal Medical Image Fusion Based on Multiple Latent Low-Rank Representation
    Lou, Xi-Cheng
    Feng, Xin
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [3] Aspect Based Sentiment Analysis on Multimodal Data: A Transformer and Low-Rank Fusion Approach
    Jin, Meilin
    Shao, Lianhe
    Wang, Xihan
    Yan, Qianqian
    Chu, Zhulu
    Luo, Tongtong
    Tang, Jiacheng
    Gao, Quanli
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 332 - 338
  • [4] Prediction of posterior capsular opacification based on heterogeneous low-rank multimodal fusion network
    Chen Z.-G.
    Wan Y.-J.
    Wang Y.-L.
    Jiang C.-L.
    Chen X.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2021, 55 (11): : 2045 - 2053
  • [5] Efficient Low-rank Multimodal Fusion with Modality-Specific Factors
    Liu, Zhun
    Shen, Ying
    Lakshminarasimhan, Varun Bharadhwaj
    Liang, Paul Pu
    Zadeh, Amir
    Morency, Louis-Philippe
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2247 - 2256
  • [6] A Three-stage multimodal emotion recognition network based on text low-rank fusion
    Zhao, Linlin
    Yang, Youlong
    Ning, Tong
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [7] Multimodal Medical Image Fusion Based on Parameter Adaptive PCNN and Latent Low-rank Representation
    WANG Wenyan
    ZHOU Xianchun
    YANG Liangjian
    Instrumentation, 2023, 10 (01) : 45 - 58
  • [8] Low-rank and sparse matrices fitting algorithm for low-rank representation
    Zhao, Jianxi
    Zhao, Lina
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2020, 79 (02) : 407 - 425
  • [9] Low-rank isomap algorithm
    Mehrbani, Eysan
    Kahaei, Mohammad Hossein
    IET SIGNAL PROCESSING, 2022, 16 (05) : 528 - 545
  • [10] Low Rank Fusion based Transformers for Multimodal Sequences
    Sahay, Saurav
    Okur, Eda
    Kumar, Shachi H.
    Nachman, Lama
    PROCEEDINGS OF THE SECOND GRAND CHALLENGE AND WORKSHOP ON MULTIMODAL LANGUAGE (CHALLENGE-HML), VOL 1, 2020, : 29 - 34