A Multi-step Attention and Multi-level Structure Network for Multimodal Sentiment Analysis

被引:0
|
作者
Zhang, Chuanlei [1 ]
Zhao, Hongwei [1 ]
Wang, Bo [2 ]
Wang, Wei [2 ]
Ke, Ting [1 ]
Li, Jianrong [1 ]
机构
[1] Tianjin Univ Sci & Technol, Sch Artificial Intelligence, Tianjin, Peoples R China
[2] Sitonholy Tianjin Technol Co Ltd, Tianjin, Peoples R China
关键词
Multimodal fusion; Sentiment analysis; Multi-step attention mechanism;
D O I
10.1007/978-3-031-17120-8_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal sentiment analysis aims to predict sentiment polarity from several modalities, which is an essential task for widespread applications. The core part of this task is to design a suitable fusion schema to integrate the heterogeneous information from different modalities. However, previous methods usually adopted simple interaction strategies, such as gate or attention mechanisms, which may lead to extracted features containing redundant information. In addition, most of them only focus on the interaction information between single modality, ignoring the modality pair's interaction information. In this paper, we propose a Multi-step Attention and Multi-level Structure network (MAMS) to address the above problems. Specifically, the multi-step attention mechanism extracts the critical information multiple times during the fusion process, which can reduce the interference of redundant information. Furthermore, the multi-level structure can capture both single modality's and modality pair's interaction information. Experimental results on two datasets (CMU-MOSI and CMU-MOSEI) demonstrate the superiority and effectiveness of our proposed MAMS model.
引用
下载
收藏
页码:723 / 735
页数:13
相关论文
共 50 条
  • [21] Multi-Level Multimodal Transformer Network for Multimodal Recipe Comprehension
    Liu, Ao
    Yuan, Shuai
    Zhang, Chenbin
    Luo, Congjian
    Liao, Yaqing
    Bai, Kun
    Xu, Zenglin
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1781 - 1784
  • [22] Multi-level textual-visual alignment and fusion network for multimodal aspect-based sentiment analysis
    Li, You
    Ding, Han
    Lin, Yuming
    Feng, Xinyu
    Chang, Liang
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
  • [23] Multi-level textual-visual alignment and fusion network for multimodal aspect-based sentiment analysis
    You Li
    Han Ding
    Yuming Lin
    Xinyu Feng
    Liang Chang
    Artificial Intelligence Review, 57
  • [24] Multi-level feature optimization and multimodal contextual fusion for sentiment analysis and emotion classification
    Huddar, Mahesh G.
    Sannakki, Sanjeev S.
    Rajpurohit, Vijay S.
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (02) : 861 - 881
  • [25] Multi-Attention Network for Sentiment Analysis
    Du, Tingting
    Huang, Yunyin
    Wu, Xian
    Chang, Huiyou
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL (NLPIR 2018), 2018, : 49 - 54
  • [26] Spatiotemporal Graph Attention Network modeling for multi-step passenger demand prediction at multi-zone level
    Dong, Chengxiang
    Zhang, Kunpeng
    Wei, Xin
    Wang, Yongchao
    Yang, Yuhui
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 603
  • [27] Multi-level fusion with deep neural networks for multimodal sentiment classification
    Zhang Guangwei
    Zhao Bing
    Li Ruifan
    The Journal of China Universities of Posts and Telecommunications, 2022, 29 (03) : 25 - 33
  • [28] Multi-level Attention Fusion for Multimodal Driving Maneuver Recognition
    Liu, Jing
    Liu, Yang
    Tian, Chengwen
    Zhao, Mengyang
    Zeng, Xinhua
    Song, Liang
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2609 - 2613
  • [29] doepipeline: a systematic approach to optimizing multi-level and multi-step data processing workflows
    Daniel Svensson
    Rickard Sjögren
    David Sundell
    Andreas Sjödin
    Johan Trygg
    BMC Bioinformatics, 20
  • [30] doepipeline: a systematic approach to optimizing multi-level and multi-step data processing workflows
    Svensson, Daniel
    Sjogren, Rickard
    Sundell, David
    Sjodin, Andreas
    Trygg, Johan
    BMC BIOINFORMATICS, 2019, 20 (01)