CubeMLP: A MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation

被引:29
|
作者
Sun, Hao [1 ]
Wang, Hongyi [1 ]
Liu, Jiaqing [2 ]
Chen, Yen-Wei [2 ]
Lin, Lanfen [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu, Shiga, Japan
关键词
multimodal processing; multimodal fusion; multimodal interaction; multimedia; MLP; sentiment analysis; depression detection;
D O I
10.1145/3503161.3548025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data. Previous research has focused on developing effective fusion strategies for exchanging and integrating mind-related information from different modalities. Some MLP-based techniques have recently achieved considerable success in a variety of computer vision tasks. Inspired by this, we explore multimodal approaches with a feature-mixing perspective in this study. To this end, we introduce CubeMLP, a multimodal feature processing framework based entirely on MLP. CubeMLP consists of three independent MLP units, each of which has two affine transformations. CubeMLP accepts all relevant modality features as input and mixes them across three axes. After extracting the characteristics using CubeMLP, the mixed multimodal features are flattened for task predictions. Our experiments are conducted on sentiment analysis datasets: CMU-MOSI and CMU-MOSEI, and depression estimation dataset: AVEC2019. The results show that CubeMLP can achieve state-of-the-art performance with a much lower computing cost.
引用
收藏
页码:3722 / 3729
页数:8
相关论文
共 50 条
  • [41] ANALYSIS OF PHONE POSTERIOR FEATURE SPACE EXPLOITING CLASS-SPECIFIC SPARSITY AND MLP-BASED SIMILARITY MEASURE
    Asaei, Afsaneh
    Picart, Benjamin
    Bourlard, Herve
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4886 - 4889
  • [42] Performance analysis of MLP-based radar detectors in Weibull-distributed clutter with respect to target doppler frequency
    Vicen-Bueno, Raul
    Jarabo-Amores, Maria P.
    Rosa-Zurera, Manuel
    Gil-Pita, Roberto
    Mata-Moya, David
    ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 2, PROCEEDINGS, 2007, 4669 : 690 - +
  • [43] Multimodal Sentiment Analysis Based on Interactive Transformer and Soft Mapping
    Li, Zuhe
    Guo, Qingbing
    Feng, Chengyao
    Deng, Lujuan
    Zhang, Qiuwen
    Zhang, Jianwei
    Wang, Fengqin
    Sun, Qian
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [44] An AutoML-based Approach to Multimodal Image Sentiment Analysis
    Lopes, Vasco
    Gaspar, Antonio
    Alexandre, Luis A.
    Cordeiro, Joao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [45] Multimodal Sentiment Analysis Based on Bidirectional Mask Attention Mechanism
    Zhang Y.
    Zhang H.
    Liu Y.
    Liang K.
    Wang Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (04) : 46 - 55
  • [46] Multimodal sentiment analysis based on improved correlation representation network
    Yaermaimaiti, Yilihamu
    Yan, Tianxing
    Zhuang, Guohang
    Kari, Tusongjiang
    International Journal of Communication Networks and Distributed Systems, 2024, 30 (06) : 679 - 698
  • [47] Convolutional MKL Based Multimodal Emotion Recognition and Sentiment Analysis
    Poria, Soujanya
    Chaturvedi, Iti
    Cambria, Erik
    Hussain, Amir
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 439 - 448
  • [48] Few-shot Multimodal Sentiment Analysis Based on Multimodal Probabilistic Fusion Prompts
    Yang, Xiaocui
    Feng, Shi
    Wang, Daling
    Zhang, Yifei
    Poria, Soujanya
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6045 - 6053
  • [49] Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis
    Yuan, Ziqi
    Fang, Jingliang
    Xu, Hua
    Gao, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3669 - 3683
  • [50] Research on Multimodal Aspect-Based Sentiment Analysis Based on Image Caption and Multimodal Aspect Extraction
    Huang, Peng
    Tao, Jun
    Su, Tengrong
    Zhang, Xiaoqing
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 5415 - 5418