Efficient Low-rank Multimodal Fusion with Modality-Specific Factors

被引:0
|
作者
Liu, Zhun [1 ]
Shen, Ying [1 ]
Lakshminarasimhan, Varun Bharadhwaj [1 ]
Liang, Paul Pu [1 ]
Zadeh, Amir [1 ]
Morency, Louis-Philippe [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multimodal research is an emerging field of artificial intelligence, and one of the main research problems in this field is multimodal fusion. The fusion of multimodal data is the process of integrating multiple unimodal representations into one compact multimodal representation. Previous research in this field has exploited the expressiveness of tensors for multimodal representation. However, these methods often suffer from exponential increase in dimensions and in computational complexity introduced by transformation of input into tensor. In this paper, we propose the Low-rank Multimodal Fusion method, which performs multimodal fusion using low-rank tensors to improve efficiency. We evaluate our model on three different tasks: multimodal sentiment analysis, speaker trait analysis, and emotion recognition. Our model achieves competitive results on all these tasks while drastically reducing computational complexity. Additional experiments also show that our model can perform robustly for a wide range of low-rank settings, and is indeed much more efficient in both training and inference compared to other methods that utilize tensor representations.
引用
收藏
页码:2247 / 2256
页数:10
相关论文
共 50 条
  • [31] Speed of information processing in traumatic brain injury: Modality-specific factors
    Madigan, NK
    DeLuca, J
    Diamond, BJ
    Tramontano, G
    Averill, A
    JOURNAL OF HEAD TRAUMA REHABILITATION, 2000, 15 (03) : 943 - 956
  • [32] Approximate low-rank factorization with structured factors
    Markovsky, Ivan
    Niranjan, Mahesan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3411 - 3420
  • [33] Missing Modality Transfer Learning via Latent Low-Rank Constraint
    Ding, Zhengming
    Shao, Ming
    Fu, Yun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4322 - 4334
  • [34] Latent Low-Rank Transfer Subspace Learning for Missing Modality Recognition
    Ding, Zhengming
    Shao, Ming
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1192 - 1198
  • [35] Low-rank lottery tickets: finding efficient low-rank neural networks via matrix differential equations
    Schotthoefer, Steffen
    Zangrando, Emanuele
    Kusch, Jonas
    Ceruti, Gianluca
    Tudisco, Francesco
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [36] Low Rank Fusion based Transformers for Multimodal Sequences
    Sahay, Saurav
    Okur, Eda
    Kumar, Shachi H.
    Nachman, Lama
    PROCEEDINGS OF THE SECOND GRAND CHALLENGE AND WORKSHOP ON MULTIMODAL LANGUAGE (CHALLENGE-HML), VOL 1, 2020, : 29 - 34
  • [37] SIMFusion: A semantic information-guided modality-specific fusion network for MR Images
    Zhang, Xiaowen
    Liu, Aiping
    Yang, Gang
    Liu, Yu
    Chen, Xun
    INFORMATION FUSION, 2024, 112
  • [38] Action semantics: A unifying conceptual framework for the selective use of multimodal and modality-specific object knowledge
    van Elk, Michiel
    van Schie, Hein
    Bekkering, Harold
    PHYSICS OF LIFE REVIEWS, 2014, 11 (02) : 220 - 250
  • [39] Efficient video hashing based on low-rank frames
    Chen, Zhenhai
    Tang, Zhenjun
    Zhang, Xinpeng
    Sun, Ronghai
    Zhang, Xianquan
    IET IMAGE PROCESSING, 2022, 16 (02) : 344 - 355
  • [40] Efficient low-rank solution of generalized Lyapunov equations
    Stephen D. Shank
    Valeria Simoncini
    Daniel B. Szyld
    Numerische Mathematik, 2016, 134 : 327 - 342