Multimodal model for the Spanish sentiment analysis in a tourism domain

被引:0
|
作者
Monsalve-Pulido, Julian [1 ]
Parra, Carlos Alberto [2 ]
Aguilar, Jose [3 ,4 ,5 ]
机构
[1] Univ Pedag & Tecnol Colombia, GIMI, Tunja, Colombia
[2] Pontificia Univ Javeriana, Bogota, Colombia
[3] Univ Los Andes, CEMISID, Merida, Venezuela
[4] Univ EAFIT, CIDITIC, Medellin, Colombia
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Multimodal model; Sentiment analysis; Opinion mining; Spanish language; Tourism;
D O I
10.1007/s13278-024-01202-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of sentiment analysis of tourism data focuses on the analysis of the multimodal characteristics of the data generated digitally by tourists on each platform or social network. Generally, their opinions have multimodal characteristics, since they combine text, images or numbers (ratings), which represents an important challenge in sentiment analysis that requires new models or multimodal data classification techniques. This work proposes a multimodal sentiment analysis model for data in Spanish in the tourism domain composed of four main phases (extraction, classification, fusion, visualization), and a transversal phase to evaluate the quality of the multimodal sentiment analysis process. Thus, the multimodal sentiment analysis model integrates a data quality model to improve multimodal sentiment analysis tasks, but in addition, the linguistic resource "SenticNet 5" is adapted to Spanish. The model was validated by applying various classification metrics, and the classification results were compared to a manually labeled dataset (TASS) using two machine learning classification algorithms. The first was Random Forest, where the manually labeled dataset has a 50% F1 score compared to the adapted SenticNet automatically generated dataset, which has a 71% F1 score measure and a 70% accuracy. The classification generated by SenticNet is 21% higher than that of the TASS data set. The second algorithm applied was Support Vector Machine (SVM), which classified the SenticNet-generated dataset with an F1 score of 72% versus the manually created dataset with 57.7% (14.3% more effective). In the fusion tests of the multimodal sentiment inputs, the accuracy results for text were 65%, for images 33%, and the fusion of both was 71%. In general, it was identified that the opinions made by users composed of text in Spanish and images improve polarity identification if an independent classification is carried out, and then apply a polarity fusion process.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] DWWP: Domain-specific new words detection and word propagation system for sentiment analysis in the tourism domain
    Li, Wei
    Guo, Kun
    Shi, Yong
    Zhu, Luyao
    Zheng, Yuanchun
    KNOWLEDGE-BASED SYSTEMS, 2018, 146 : 203 - 214
  • [32] Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
    Xie, Zhuyang
    Yang, Yan
    Wang, Jie
    Liu, Xiaorong
    Li, Xiaofan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7657 - 7670
  • [33] TWEET TALK: SENTIMENT ANALYSIS IN TOURISM
    Vladimirova, Alevtina
    5TH INTERNATIONAL ACADEMIC CONFERENCE ON SOCIAL SCIENCE, MULTIDISCIPLINARY AND EUROPEAN STUDIES (MIRDEC), 2017, : 43 - 52
  • [34] Tourism forecasting with granular sentiment analysis
    Li, Hengyun
    Gao, Huicai
    Song, Haiyan
    ANNALS OF TOURISM RESEARCH, 2023, 103
  • [35] Sentiment Analysis of Twitter in Tourism Destinations
    Perez Cabanero, Carmen
    Bigne, Enrique
    Ruiz, Carla
    Carlos Cuenca, Antonio
    3RD INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2020), 2020, : 181 - 189
  • [36] Joint training strategy of unimodal and multimodal for multimodal sentiment analysis
    Li, Meng
    Zhu, Zhenfang
    Li, Kefeng
    Zhou, Lihua
    Zhao, Zhen
    Pei, Hongli
    IMAGE AND VISION COMPUTING, 2024, 149
  • [37] Multimodal transformer with adaptive modality weighting for multimodal sentiment analysis
    Wang, Yifeng
    He, Jiahao
    Wang, Di
    Wang, Quan
    Wan, Bo
    Luo, Xuemei
    NEUROCOMPUTING, 2024, 572
  • [38] A Spanish Political Tweets Fine-Tuned Sentiment Analysis Model
    Jimenez-Bravo, Diego M.
    Lozano Murciego, Alvaro
    Bajo, Javier
    De La Iglesia, Daniel H.
    Pinzon, Cristian
    NEW TRENDS IN DISRUPTIVE TECHNOLOGIES, TECH ETHICS AND ARTIFICIAL INTELLIGENCE, DITTET 2022, 2023, 1430 : 91 - 102
  • [39] Heterogeneous graph convolution based on In-domain Self-supervision for Multimodal Sentiment Analysis
    Zeng, Yufei
    Li, Zhixin
    Tang, Zhenjun
    Chen, Zhenbin
    Ma, Huifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [40] VAE-Based Adversarial Multimodal Domain Transfer for Video-Level Sentiment Analysis
    Wang, Yanan
    Wu, Jianming
    Furumai, Kazuaki
    Wada, Shinya
    Kurihara, Satoshi
    IEEE ACCESS, 2022, 10 : 51315 - 51324