Non-Uniform Attention Network for Multi-modal Sentiment Analysis

被引:10
|
作者
Wang, Binqiang [1 ,2 ,3 ]
Dong, Gang [1 ,2 ,3 ]
Zhao, Yaqian [1 ,2 ,3 ]
Li, Rengang [1 ,2 ,3 ]
Cao, Qichun [1 ,2 ,3 ]
Chao, Yinyin [1 ,2 ,3 ]
机构
[1] Inspur Beijing Elect Informat Ind Co Ltd, Beijing, Peoples R China
[2] Inspur Elect Informat Ind Co Ltd, Jinan, Peoples R China
[3] Shandong Mass Informat Technol Res Inst, Jinan, Peoples R China
来源
关键词
Multi-modal information fusion; Video sentiment analysis; Attention mechanism; EMOTION RECOGNITION; FUSION;
D O I
10.1007/978-3-030-98358-1_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remarkable success has been achieved in the multi-modal sentiment analysis community thanks to the existence of annotated multi-modal data sets. However, coming from three different modalities, text, sound, and vision, establishes significant barriers for better feature fusion. In this paper, we introduce "NUAN" , a non-uniform attention network for multi-modal feature fusion. NUAN is designed based on attention mechanism via considering three modalities simultaneously, but not uniformly: the text is seen as a determinate representation, with the hope that by leveraging the acoustic and visual representation, we are able to inject the effective information into a solid representation, named as tripartite interaction representation. A novel non-uniform attention module is inserted into adjacent time steps in LSTM (Long Shot-Term Memory) and processes information recurrently. The final outputs of LSTM and NUAM are concatenated to a vector, which is imported into a linear embedding layer to output the sentiment analysis result. The experimental analysis of two databases demonstrates the effectiveness of the proposed method.
引用
收藏
页码:612 / 623
页数:12
相关论文
共 50 条
  • [1] Non-Uniform Attention Network for Multi-modal Sentiment Analysis
    Wang, Binqiang
    Dong, Gang
    Zhao, Yaqian
    Li, Rengang
    Cao, Qichun
    Chao, Yinyin
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13141 LNCS : 612 - 623
  • [2] Multi-modal fusion attention sentiment analysis for mixed sentiment classification
    Xue, Zhuanglin
    Xu, Jiabin
    [J]. COGNITIVE COMPUTATION AND SYSTEMS, 2024,
  • [3] Contextual Inter-modal Attention for Multi-modal Sentiment Analysis
    Ghosal, Deepanway
    Akhtar, Md Shad
    Chauhan, Dushyant
    Poria, Soujanya
    Ekbalt, Asif
    Bhattacharyyat, Pushpak
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3454 - 3466
  • [4] Mixture of Attention Variants for Modal Fusion in Multi-Modal Sentiment Analysis
    He, Chao
    Zhang, Xinghua
    Song, Dongqing
    Shen, Yingshan
    Mao, Chengjie
    Wen, Huosheng
    Zhu, Dingju
    Cai, Lihua
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (02)
  • [5] Multi-Modal Sentiment Analysis Based on Interactive Attention Mechanism
    Wu, Jun
    Zhu, Tianliang
    Zheng, Xinli
    Wang, Chunzhi
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [6] MIA-Net: Multi-Modal Interactive Attention Network for Multi-Modal Affective Analysis
    Li, Shuzhen
    Zhang, Tong
    Chen, Bianna
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2796 - 2809
  • [7] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Zhuang, Xuqiang
    Liu, Fangai
    Hou, Jian
    Hao, Jianhua
    Cai, Xiaohong
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1943 - 1960
  • [8] Context-aware Interactive Attention for Multi-modal Sentiment and Emotion Analysis
    Chauhan, Dushyant Singh
    Akhtar, Md Shad
    Ekbal, Asif
    Bhattacharyya, Pushpak
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5647 - 5657
  • [9] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Xuqiang Zhuang
    Fangai Liu
    Jian Hou
    Jianhua Hao
    Xiaohong Cai
    [J]. Neural Processing Letters, 2022, 54 : 1943 - 1960
  • [10] Attention Network for Non-Uniform Deblurring
    Qi, Qing
    Guo, Jichang
    Jin, Weipei
    [J]. IEEE ACCESS, 2020, 8 (08): : 100044 - 100057