CubeMLP: A MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation

被引:29
|
作者
Sun, Hao [1 ]
Wang, Hongyi [1 ]
Liu, Jiaqing [2 ]
Chen, Yen-Wei [2 ]
Lin, Lanfen [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu, Shiga, Japan
关键词
multimodal processing; multimodal fusion; multimodal interaction; multimedia; MLP; sentiment analysis; depression detection;
D O I
10.1145/3503161.3548025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data. Previous research has focused on developing effective fusion strategies for exchanging and integrating mind-related information from different modalities. Some MLP-based techniques have recently achieved considerable success in a variety of computer vision tasks. Inspired by this, we explore multimodal approaches with a feature-mixing perspective in this study. To this end, we introduce CubeMLP, a multimodal feature processing framework based entirely on MLP. CubeMLP consists of three independent MLP units, each of which has two affine transformations. CubeMLP accepts all relevant modality features as input and mixes them across three axes. After extracting the characteristics using CubeMLP, the mixed multimodal features are flattened for task predictions. Our experiments are conducted on sentiment analysis datasets: CMU-MOSI and CMU-MOSEI, and depression estimation dataset: AVEC2019. The results show that CubeMLP can achieve state-of-the-art performance with a much lower computing cost.
引用
收藏
页码:3722 / 3729
页数:8
相关论文
共 50 条
  • [1] An MLP-based model for identifying qEEG in depression
    Mitra, S
    Sarbadhikari, SN
    Pal, SK
    INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1996, 43 (03): : 179 - 187
  • [2] An MLP-based model for identifying qEEG in depression
    Machine Intelligence Unit, Indian Statistical Institute, Calcutta 700035, India
    INT. J. BIO-MED. COMPUT., 3 (179-187):
  • [3] MLP-Based Model for Estimation of Methane Seam Pressure
    Skiba, Marta
    Dutka, Barbara
    Mlynarczuk, Mariusz
    ENERGIES, 2021, 14 (22)
  • [4] MLP-Based Regression Prediction Model For Compound Bioactivity
    Qin, Yongfei
    Li, Chao
    Shi, Xia
    Wang, Weigang
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [5] MLP-BASED FACTOR ANALYSIS FOR TANDEM SPEECH RECOGNITION
    Ferras, Marc
    Bourlard, Herve
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6719 - 6723
  • [6] Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator
    Pinto, Joel
    Garimella, Sivaram
    Magimai-Doss, Mathew
    Hermansky, Hynek
    Bourlard, Herve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 225 - 241
  • [7] Disturbance Magnitude Estimation: MLP-based Fusion Approach for Bulk Power Systems
    Zeng, Chujie
    Qiu, Wei
    Wang, Weikang
    Sun, Kaiqi
    Chen, Chang
    Sundaresh, Lakshmi
    Liu, Yilu
    2022 IEEE/IAS 58TH INDUSTRIAL AND COMMERCIAL POWER SYSTEMS TECHNICAL CONFERENCE (I&CPS), 2022,
  • [8] TensorFormer: A Tensor-Based Multimodal Transformer for Multimodal Sentiment Analysis and Depression Detection
    Sun, Hao
    Chen, Yen-Wei
    Lin, Lanfen
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2776 - 2786
  • [9] A Hierarchical Three-Dimensional MLP-Based Model for EEG Emotion Recognition
    Li, Wei
    Tian, Ye
    Dong, Jianzhang
    Fang, Cheng
    IEEE SENSORS LETTERS, 2023, 7 (10)
  • [10] MLP-based multimodal tomato detection in complex scenarios: Insights from task-specific analysis of feature fusion architectures
    Chen, Wenjun
    Rao, Yuan
    Wang, Fengyi
    Zhang, Yu
    Wang, Tan
    Jin, Xiu
    Hou, Wenhui
    Jiang, Zhaohui
    Zhang, Wu
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 221