Image-Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

被引:9
|
作者
Das, Ringki [1 ]
Singh, Thoudam Doren [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Comp Sci & Engn, Silchar 788010, Assam, India
关键词
Multimodal sentiment analysis; low resource language; caption generation; machine learning classifier; late fusion;
D O I
10.1145/3584861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the earlier sentiment analysis approach is based on unimodal data, and less effort is paid to the multimodal data. However, the presence of multimodal information helps us to get a clearer understanding of the sentiment. To the best of our knowledge, less work has been introduced on the image-text multimodal sentiment analysis framework of Assamese, a low-resource Indian language mostly spoken in the northeast part of India. We built an Assamese news articles dataset consisting of news text and associated images and one image caption to conduct an experimental study. Focusing on important words and discriminative regions of the images mostly related to sentiment, two individual unimodal such as textual and visual models are proposed. The visual model is developed using an encoder-decoder-based image caption generation system. An image-text multimodal approach is proposed to explore the internal correlation between textual and visual features for joint sentiment classification. Finally, we propose the multimodal sentiment analysis framework, i.e., Textual Visual Multimodal Fusion, by employing a late fusion scheme to merge the three different modalities for the final sentiment prediction. Experimental results conducted on the Assamese dataset built in-house demonstrate that the contextual integration of multimodal features delivers better performance than unimodal features.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Image-text sentiment analysis via deep multimodal attentive fusion
    Huang, Feiran
    Zhang, Xiaoming
    Zhao, Zhonghua
    Xu, Jie
    Li, Zhoujun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 167 : 26 - 37
  • [2] Multimodal Sentiment Analysis With Image-Text Interaction Network
    Zhu, Tong
    Li, Leida
    Yang, Jufeng
    Zhao, Sicheng
    Liu, Hantao
    Qian, Jiansheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3375 - 3385
  • [3] Multimodal Sentiment Analysis With Image-Text Correlation Modal
    Li, Yuxin
    Jiang, Shan
    Chaomurilige
    [J]. 2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 281 - 286
  • [4] Understanding image-text relations and news values for multimodal news analysis
    Cheema, Gullal S.
    Hakimov, Sherzod
    Mueller-Budack, Eric
    Otto, Christian
    Bateman, John A.
    Ewerth, Ralph
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [5] Multimodal Fake News Analysis Based on Image-Text Similarity
    Zhang, Xichen
    Dadkhah, Sajjad
    Weismann, Alexander Gerald
    Kanaani, Mohammad Amin
    Ghorbani, Ali A.
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 959 - 972
  • [6] Image-Text Fusion Sentiment Analysis Method Based on Image Semantic Translation
    Huang, Jian
    Wang, Ying
    [J]. Computer Engineering and Applications, 2023, 59 (11) : 180 - 187
  • [7] Image-text interaction graph neural network for image-text sentiment analysis
    Wenxiong Liao
    Bi Zeng
    Jianqi Liu
    Pengfei Wei
    Jiongkun Fang
    [J]. Applied Intelligence, 2022, 52 : 11184 - 11198
  • [8] Image-text interaction graph neural network for image-text sentiment analysis
    Liao, Wenxiong
    Zeng, Bi
    Liu, Jianqi
    Wei, Pengfei
    Fang, Jiongkun
    [J]. APPLIED INTELLIGENCE, 2022, 52 (10) : 11184 - 11198
  • [9] An image-text consistency driven multimodal sentiment analysis approach for social media
    Zhao, Ziyuan
    Zhu, Huiying
    Xue, Zehao
    Liu, Zhao
    Tian, Jing
    Chua, Matthew Chin Heng
    Liu, Maofu
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
  • [10] A TBGAV-Based Image-Text Multimodal Sentiment Analysis Method for Tourism Reviews
    Zhang, Ke
    Wang, Shunmin
    Yu, Yuanyu
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2023, 18 (01) : 1 - 17