Low-rank Multimodal Fusion Algorithm Based on Context Modeling

被引:6
|
作者
Bai, Zongwen [1 ]
Chen, Xiaohuan [1 ]
Zhou, Meili [1 ]
Yi, Tingting [1 ]
Chien, Wei-Che [2 ]
机构
[1] Yanan Univ, Sch Phys & Elect Informat, Yanan, Peoples R China
[2] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2021年 / 22卷 / 04期
基金
中国国家自然科学基金;
关键词
Neural architecture search; Sequence regression models; Performance prediction; Network structure feature; NETWORK; COMMUNICATION;
D O I
10.53106/160792642021072204018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important part of human daily life, video contains rich emotion information. Therefore, it is a current research trend to find efficient approaches to conducting emotional analysis on videos. Based on tensor fusion, we propose a low-rank multimodal fusion context modeling. At the beginning, modality information is preprocessed by GRU (Gate Recurrent Unit) in Recurrent Neural Network. We construct semantic dependencies to convey contextual information in the context of the video. The proposed model improves performance of applied emotion classification. Additionally, LMF (Low-rank Tensor Multimodal Fusion) with the advantage of end-toend learning is implemented as a fusion mechanism to improve classification efficiency. We implemented the experiments on CMU-MOSI, POM, and IEMOCAP of multi-modal sentiment analysis, speaker traits and emotion recognition. And results show that our method improved the performance by a margin of 2.9%, 1.3%, and 12.2% respectively contrast with TFN (Tensor Fusion Network).
引用
收藏
页码:913 / 921
页数:9
相关论文
共 50 条
  • [21] A link prediction algorithm based on low-rank matrix completion
    Gao, Man
    Chen, Ling
    Li, Bin
    Liu, Wei
    APPLIED INTELLIGENCE, 2018, 48 (12) : 4531 - 4550
  • [22] A low-rank tensor-based algorithm for face recognition
    Lita, Lacramioara
    Pelican, Elena
    APPLIED MATHEMATICAL MODELLING, 2015, 39 (3-4) : 1266 - 1274
  • [23] Multimodal Score Fusion with Sparse Low-rank Bilinear Pooling for Egocentric Hand Action Recognition
    Roy, Kankana
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [24] Latent Low-rank Graph Learning for Multimodal Clustering
    Zhong, Guo
    Pun, Chi-Man
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 492 - 503
  • [25] MM-LMF: A Low-Rank Multimodal Fusion Dangerous Driving Behavior Recognition Method Based on FMCW Signals
    Hao, Zhanjun
    Li, Zepei
    Dang, Xiaochao
    Ma, Zhongyu
    Liu, Gaoyuan
    ELECTRONICS, 2022, 11 (22)
  • [26] Joint low-rank tensor fusion and cross-modal attention for multimodal physiological signals based emotion recognition
    Wan, Xin
    Wang, Yongxiong
    Wang, Zhe
    Tang, Yiheng
    Liu, Benke
    PHYSIOLOGICAL MEASUREMENT, 2024, 45 (07)
  • [27] Optimized Multimodal Neurological Image Fusion Based on Low-Rank Texture Prior Decomposition and Super-Pixel Segmentation
    Das, Manisha
    Gupta, Deep
    Radeva, Petia
    Bakde, Ashwini M.
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [28] Multispectral and hyperspectral image fusion based on low-rank unfolding network
    Yan, Jun
    Zhang, Kai
    Zhang, Feng
    Ge, Chiru
    Wan, Wenbo
    Sun, Jiande
    SIGNAL PROCESSING, 2023, 213
  • [29] Multimodal Context Fusion Based Dense Video Captioning Algorithm
    Li, Meiqi
    Zhou, Ziwei
    ENGINEERING LETTERS, 2025, 33 (04) : 1061 - 1072
  • [30] Adaptive Rank Estimation Based Tensor Factorization Algorithm for Low-Rank Tensor Completion
    Liu, Han
    Liu, Jing
    Su, Liyu
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 3444 - 3449