CubeMLP: A MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation

被引:29
|
作者
Sun, Hao [1 ]
Wang, Hongyi [1 ]
Liu, Jiaqing [2 ]
Chen, Yen-Wei [2 ]
Lin, Lanfen [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu, Shiga, Japan
关键词
multimodal processing; multimodal fusion; multimodal interaction; multimedia; MLP; sentiment analysis; depression detection;
D O I
10.1145/3503161.3548025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data. Previous research has focused on developing effective fusion strategies for exchanging and integrating mind-related information from different modalities. Some MLP-based techniques have recently achieved considerable success in a variety of computer vision tasks. Inspired by this, we explore multimodal approaches with a feature-mixing perspective in this study. To this end, we introduce CubeMLP, a multimodal feature processing framework based entirely on MLP. CubeMLP consists of three independent MLP units, each of which has two affine transformations. CubeMLP accepts all relevant modality features as input and mixes them across three axes. After extracting the characteristics using CubeMLP, the mixed multimodal features are flattened for task predictions. Our experiments are conducted on sentiment analysis datasets: CMU-MOSI and CMU-MOSEI, and depression estimation dataset: AVEC2019. The results show that CubeMLP can achieve state-of-the-art performance with a much lower computing cost.
引用
收藏
页码:3722 / 3729
页数:8
相关论文
共 50 条
  • [21] Multimodal model for the Spanish sentiment analysis in a tourism domain
    Monsalve-Pulido, Julian
    Parra, Carlos Alberto
    Aguilar, Jose
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [22] ModWaveMLP: MLP-Based Mode Decomposition andWavelet Denoising Model to Defeat Complex Structures in Traffic Forecasting
    Sun, Ke
    Liu, Pei
    Li, Pengfei
    Liao, Zhifang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9035 - 9043
  • [23] BiSMSM: A Hybrid MLP-Based Model of Global Self-Attention Processes for EEG-Based Emotion Recognition
    Li, Wei
    Tian, Ye
    Hou, Bowen
    Dong, Jianzhang
    Shao, Shitong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 37 - 48
  • [24] The Optimal MLP-based Model for Displacement Field Measurement in 2D Images and Its Application Perspective
    Mangileva, Daria
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [25] A MLP Based PD Estimation Model For SME Credits
    Derelioglu, Guelnur
    Gurgen, Fikret
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 264 - 267
  • [26] Prediction of landslide tsunami run-up on a plane beach through feature selected MLP-based model
    Aydin, Baran
    Yaguzluk, Sava
    Acikkar, Mustafa
    JOURNAL OF OCEAN ENGINEERING AND SCIENCE, 2024, 9 (03) : 222 - 231
  • [27] ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
    Dai, Chuan
    Wei, Yajuan
    Xu, Zhijie
    Chen, Minsi
    Liu, Ying
    Fan, Jiulun
    SENSORS, 2023, 23 (05)
  • [28] A Multimodal Sentiment Analysis Model for Graphic Texts Based on Deep Feature Interaction Networks
    Chang, Wanjun
    Zhang, Dongfang
    International Journal of Ambient Computing and Intelligence, 2024, 15 (01)
  • [29] GATED MECHANISM FOR ATTENTION BASED MULTIMODAL SENTIMENT ANALYSIS
    Kumar, Ayush
    Vepa, Jithendra
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4477 - 4481
  • [30] Multimodal sentiment analysis based on fusion methods: A survey
    Zhu, Linan
    Zhu, Zhechao
    Zhang, Chenwei
    Xu, Yifei
    Kong, Xiangjie
    INFORMATION FUSION, 2023, 95 : 306 - 325