TEDT: Transformer-Based Encoding–Decoding Translation Network for Multimodal Sentiment Analysis

被引:0
|
作者
Fan Wang
Shengwei Tian
Long Yu
Jing Liu
Junwen Wang
Kun Li
Yongtao Wang
机构
[1] University of Xinjiang,School of Software
[2] University of Xinjiang,Network and Information Center
来源
Cognitive Computation | 2023年 / 15卷
关键词
Multimodal sentiment analysis; Transformer; Multimodal fusion; Multimodal attention;
D O I
暂无
中图分类号
学科分类号
摘要
Multimodal sentiment analysis is a popular and challenging research topic in natural language processing, but the impact of individual modal data in videos on sentiment analysis results can be different. In the temporal dimension, natural language sentiment is influenced by nonnatural language sentiment, which may enhance or weaken the original sentiment of the current natural language. In addition, there is a general problem of poor quality of nonnatural language features, which essentially hinders the effect of multimodal fusion. To address the above issues, we proposed a multimodal encoding–decoding translation network with a transformer and adopted a joint encoding–decoding method with text as the primary information and sound and image as the secondary information. To reduce the negative impact of nonnatural language data on natural language data, we propose a modality reinforcement cross-attention module to convert nonnatural language features into natural language features to improve their quality and better integrate multimodal features. Moreover, the dynamic filtering mechanism filters out the error information generated in the cross-modal interaction to further improve the final output. We evaluated the proposed method on two multimodal sentiment analysis benchmark datasets (MOSI and MOSEI), and the accuracy of the method was 89.3% and 85.9%, respectively. In addition, our method outperformed the current state-of-the-art methods. Our model can greatly improve the effect of multimodal fusion and more accurately analyze human sentiment.
引用
收藏
页码:289 / 303
页数:14
相关论文
共 50 条
  • [31] Disentangled Multimodal Brain MR Image Translation via Transformer-based Modality Infuser
    Cho, Jihoon
    Liu, Xiaofeng
    Xing, Fangxu
    Ouyang, Jinsong
    El Fakhri, Georges
    Park, Jinah
    Woo, Jonghye
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [32] SentimentFormer: A Transformer-Based Multimodal Fusion Framework for Enhanced Sentiment Analysis of Memes in Under-Resourced Bangla Language
    Faria, Fatema Tuj Johora
    Baniata, Laith H.
    Baniata, Mohammad H.
    Khair, Mohannad A.
    Bani Ata, Ahmed Ibrahim
    Bunterngchit, Chayut
    Kang, Sangwoo
    ELECTRONICS, 2025, 14 (04):
  • [33] An emotion-driven, transformer-based network for multimodal fake news detection
    Yadav, Ashima
    Gupta, Anika
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (01)
  • [34] Transformer-based models for multimodal irony detection
    Tomás D.
    Ortega-Bueno R.
    Zhang G.
    Rosso P.
    Schifanella R.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (6) : 7399 - 7410
  • [35] Transformer-Based Multimodal Infusion Dialogue Systems
    Liu, Bo
    He, Lejian
    Liu, Yafei
    Yu, Tianyao
    Xiang, Yuejia
    Zhu, Li
    Ruan, Weijian
    ELECTRONICS, 2022, 11 (20)
  • [36] An emotion-driven, transformer-based network for multimodal fake news detection
    Ashima Yadav
    Anika Gupta
    International Journal of Multimedia Information Retrieval, 2024, 13
  • [37] TRANSFORMER-BASED HIERARCHICAL CLUSTERING FOR BRAIN NETWORK ANALYSIS
    Dai, Wei
    Cui, Hejie
    Kan, Xuan
    Guo, Ying
    Van Rooij, Sanne
    Yang, Carl
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [38] Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting
    Fu, Fengyi
    Fang, Shancheng
    Chen, Weidong
    Mao, Zhendong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [39] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Zhuang, Xuqiang
    Liu, Fangai
    Hou, Jian
    Hao, Jianhua
    Cai, Xiaohong
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1943 - 1960
  • [40] TensorFormer: A Tensor-Based Multimodal Transformer for Multimodal Sentiment Analysis and Depression Detection
    Sun, Hao
    Chen, Yen-Wei
    Lin, Lanfen
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2776 - 2786