Image-Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

被引:9
|
作者
Das, Ringki [1 ]
Singh, Thoudam Doren [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Comp Sci & Engn, Silchar 788010, Assam, India
关键词
Multimodal sentiment analysis; low resource language; caption generation; machine learning classifier; late fusion;
D O I
10.1145/3584861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the earlier sentiment analysis approach is based on unimodal data, and less effort is paid to the multimodal data. However, the presence of multimodal information helps us to get a clearer understanding of the sentiment. To the best of our knowledge, less work has been introduced on the image-text multimodal sentiment analysis framework of Assamese, a low-resource Indian language mostly spoken in the northeast part of India. We built an Assamese news articles dataset consisting of news text and associated images and one image caption to conduct an experimental study. Focusing on important words and discriminative regions of the images mostly related to sentiment, two individual unimodal such as textual and visual models are proposed. The visual model is developed using an encoder-decoder-based image caption generation system. An image-text multimodal approach is proposed to explore the internal correlation between textual and visual features for joint sentiment classification. Finally, we propose the multimodal sentiment analysis framework, i.e., Textual Visual Multimodal Fusion, by employing a late fusion scheme to merge the three different modalities for the final sentiment prediction. Experimental results conducted on the Assamese dataset built in-house demonstrate that the contextual integration of multimodal features delivers better performance than unimodal features.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Collaborative fine-grained interaction learning for image-text sentiment analysis
    Xiao, Xingwang
    Pu, Yuanyuan
    Zhou, Dongming
    Cao, Jinde
    Gu, Jinjing
    Zhao, Zhengpeng
    Xu, Dan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [22] Topic Modeling, Sentiment Analysis and Text Summarization for Analyzing News Headlines and Articles
    Thakur, Omswroop
    Saritha, Sri Khetwat
    Jain, Sweta
    [J]. MACHINE LEARNING, IMAGE PROCESSING, NETWORK SECURITY AND DATA SCIENCES, MIND 2022, PT I, 2022, 1762 : 220 - 239
  • [23] Multi-token Fusion Framework for Multimodal Sentiment Analysis
    Long, Zhihui
    Deng, Huan
    Yang, Zhenguo
    Liu, Wenyin
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 424 - 438
  • [24] TETFN: A text enhanced transformer fusion network for multimodal sentiment analysis
    Wang, Di
    Guo, Xutong
    Tian, Yumin
    Liu, Jinhui
    He, LiHuo
    Luo, Xuemei
    [J]. PATTERN RECOGNITION, 2023, 136
  • [25] Sentiment analysis based on text information enhancement and multimodal feature fusion
    Liu, Zijun
    Cai, Li
    Yang, Wenjie
    Liu, Junhui
    [J]. PATTERN RECOGNITION, 2024, 156
  • [26] Detection of citrus diseases in complex backgrounds based on image-text multimodal fusion and knowledge assistance
    Qiu, Xia
    Chen, Hongwen
    Huang, Ping
    Zhong, Dan
    Guo, Tao
    Pu, Changbin
    Li, Zongnan
    Liu, Yongling
    Chen, Jin
    Wang, Si
    [J]. FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [27] Image-Text Out-Of-Context Detection Using Synthetic Multimodal Misinformation
    Shalabi, Fatma
    Nguyen, Huy H.
    Felouat, Hichem
    Chang, Ching-Chun
    Echizen, Isao
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 605 - 612
  • [28] Attention-Based Modality-Gated Networks for Image-Text Sentiment Analysis
    Huang, Feiran
    Wei, Kaimin
    Weng, Jian
    Li, Zhoujun
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)
  • [29] Cross-Modal Sentiment Analysis Based on CLIP Image-Text Attention Interaction
    Lu, Xintao
    Ni, Yonglong
    Ding, Zuohua
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 895 - 903
  • [30] Sentiment Analysis of Sindhi News Articles using Deep Learning
    Barakzai, Fahama
    Bhatti, Sania
    Saddar, Salahuddin
    [J]. 2022 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES (CSIT), 2022, : 26 - 31