Image-Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

被引:9
|
作者
Das, Ringki [1 ]
Singh, Thoudam Doren [1 ]
机构
[1] Natl Inst Technol Silchar, Dept Comp Sci & Engn, Silchar 788010, Assam, India
关键词
Multimodal sentiment analysis; low resource language; caption generation; machine learning classifier; late fusion;
D O I
10.1145/3584861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the earlier sentiment analysis approach is based on unimodal data, and less effort is paid to the multimodal data. However, the presence of multimodal information helps us to get a clearer understanding of the sentiment. To the best of our knowledge, less work has been introduced on the image-text multimodal sentiment analysis framework of Assamese, a low-resource Indian language mostly spoken in the northeast part of India. We built an Assamese news articles dataset consisting of news text and associated images and one image caption to conduct an experimental study. Focusing on important words and discriminative regions of the images mostly related to sentiment, two individual unimodal such as textual and visual models are proposed. The visual model is developed using an encoder-decoder-based image caption generation system. An image-text multimodal approach is proposed to explore the internal correlation between textual and visual features for joint sentiment classification. Finally, we propose the multimodal sentiment analysis framework, i.e., Textual Visual Multimodal Fusion, by employing a late fusion scheme to merge the three different modalities for the final sentiment prediction. Experimental results conducted on the Assamese dataset built in-house demonstrate that the contextual integration of multimodal features delivers better performance than unimodal features.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Multimodal modelling of human emotion using sound, image and text fusion
    Seyed Sadegh Hosseini
    Mohammad Reza Yamaghani
    Soodabeh Poorzaker Arabani
    [J]. Signal, Image and Video Processing, 2024, 18 : 71 - 79
  • [42] Multimodal modelling of human emotion using sound, image and text fusion
    Hosseini, Seyed Sadegh
    Yamaghani, Mohammad Reza
    Arabani, Soodabeh Poorzaker
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 71 - 79
  • [43] Multimodal Sentiment Analysis using Deep Learning Fusion Techniques and Transformers
    Bin Habib, Muhaimin
    Hafiz, Md. Ferdous Bin
    Khan, Niaz Ashraf
    Hossain, Sohrab
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 856 - 863
  • [44] A Precise Framework for Rice Leaf Disease Image-Text Retrieval Using FHTW-Net
    Zhou, Hongliang
    Hu, Yufan
    Liu, Shuai
    Zhou, Guoxiong
    Xu, Jiaxin
    Chen, Aibin
    Wang, Yanfeng
    Li, Liujun
    Hu, Yahui
    [J]. PLANT PHENOMICS, 2024, 7
  • [45] Explainable stock prices prediction from financial news articles using sentiment analysis
    Gite, Shilpa
    Khatavkar, Hrituja
    Kotecha, Ketan
    Srivastava, Shilpi
    Maheshwari, Priyam
    Pandey, Neerav
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [46] Explainable stock prices prediction from financial news articles using sentiment analysis
    Gite, Shilpa
    Khatavkar, Hrituja
    Kotecha, Ketan
    Srivastava, Shilpi
    Maheshwari, Priyam
    Pandey, Neerav
    [J]. PeerJ Computer Science, 2021, 7 : 1 - 21
  • [47] Text-image semantic relevance identification for aspect-based multimodal sentiment analysis
    Zhang, Tianzhi
    Zhou, Gang
    Lu, Jicang
    Li, Zhibo
    Wu, Hao
    Liu, Shuo
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [48] Coordinated-joint translation fusion framework with sentiment-interactive graph convolutional networks for multimodal sentiment analysis
    Lu, Qiang
    Sun, Xia
    Gao, Zhizezhang
    Long, Yunfei
    Feng, Jun
    Zhang, Hao
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [49] A Novel Framework Using Neutrosophy for Integrated Speech and Text Sentiment Analysis
    Mishra, Kritika
    Kandasamy, Ilanthenral
    Kandasamy W. B., Vasantha
    Smarandache, Florentin
    [J]. SYMMETRY-BASEL, 2020, 12 (10): : 1 - 22
  • [50] Sentiment Analysis of COVID-19 using Multimodal Fusion Neural Networks
    Ermatita, Ermatita
    Abdiansah, Abdiansah
    Rini, Dian Palupi
    Febry, Fatmalina
    [J]. TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (03): : 1316 - 1321