Multimodal model for the Spanish sentiment analysis in a tourism domain

被引:0
|
作者
Monsalve-Pulido, Julian [1 ]
Parra, Carlos Alberto [2 ]
Aguilar, Jose [3 ,4 ,5 ]
机构
[1] Univ Pedag & Tecnol Colombia, GIMI, Tunja, Colombia
[2] Pontificia Univ Javeriana, Bogota, Colombia
[3] Univ Los Andes, CEMISID, Merida, Venezuela
[4] Univ EAFIT, CIDITIC, Medellin, Colombia
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Multimodal model; Sentiment analysis; Opinion mining; Spanish language; Tourism;
D O I
10.1007/s13278-024-01202-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of sentiment analysis of tourism data focuses on the analysis of the multimodal characteristics of the data generated digitally by tourists on each platform or social network. Generally, their opinions have multimodal characteristics, since they combine text, images or numbers (ratings), which represents an important challenge in sentiment analysis that requires new models or multimodal data classification techniques. This work proposes a multimodal sentiment analysis model for data in Spanish in the tourism domain composed of four main phases (extraction, classification, fusion, visualization), and a transversal phase to evaluate the quality of the multimodal sentiment analysis process. Thus, the multimodal sentiment analysis model integrates a data quality model to improve multimodal sentiment analysis tasks, but in addition, the linguistic resource "SenticNet 5" is adapted to Spanish. The model was validated by applying various classification metrics, and the classification results were compared to a manually labeled dataset (TASS) using two machine learning classification algorithms. The first was Random Forest, where the manually labeled dataset has a 50% F1 score compared to the adapted SenticNet automatically generated dataset, which has a 71% F1 score measure and a 70% accuracy. The classification generated by SenticNet is 21% higher than that of the TASS data set. The second algorithm applied was Support Vector Machine (SVM), which classified the SenticNet-generated dataset with an F1 score of 72% versus the manually created dataset with 57.7% (14.3% more effective). In the fusion tests of the multimodal sentiment inputs, the accuracy results for text were 65%, for images 33%, and the fusion of both was 71%. In general, it was identified that the opinions made by users composed of text in Spanish and images improve polarity identification if an independent classification is carried out, and then apply a polarity fusion process.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multimodal Sentiment Analysis of Spanish Online Videos
    Rosas, Veronica Perez
    Mihalcea, Rada
    Morency, Louis-Philippe
    IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 38 - 45
  • [2] Application of Summarization and Sentiment Analysis in the Tourism domain
    Premakumara, Nilantha
    Shiranthika, C.
    Welideniya, Praneeth
    Bandara, Chamath
    Prasad, Ishanka
    Sumathipala, Sagara
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [3] Multimodal Event-Aware Network for Sentiment Analysis in Tourism
    Wang, Lijuan
    Guo, Wenya
    Yao, Xingxu
    Zhang, Yuxiang
    Yang, Jufeng
    IEEE MULTIMEDIA, 2021, 28 (02) : 49 - 58
  • [4] DOMAIN ADAPTABLE MODEL FOR SENTIMENT ANALYSIS
    Kalra, Vaishali
    Agrawal, Rashmi
    Sharma, Srishti
    MECHATRONIC SYSTEMS AND CONTROL, 2022, 50 (02): : 81 - 86
  • [5] Learning Disentangled Representation for Multimodal Cross-Domain Sentiment Analysis
    Zhang, Yuhao
    Zhang, Ying
    Guo, Wenya
    Cai, Xiangrui
    Yuan, Xiaojie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7956 - 7966
  • [6] Enhancing Multimodal Tourism Review Sentiment Analysis Through Advanced Feature Association Techniques
    Chen, Peng
    Fu, Lingmei
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS IN THE SERVICE SECTOR, 2024, 15 (01)
  • [7] Sentiment Analysis in Spanish
    Martinez Camara, Eugenio
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (56): : 103 - 106
  • [8] Joint Sentiment Part Topic Regression Model for Multimodal Analysis
    Li, Mengyao
    Zhu, Yonghua
    Gao, Wenjing
    Cao, Meng
    Wang, Shaoxiu
    INFORMATION, 2020, 11 (10) : 1 - 16
  • [9] A survey of multimodal sentiment analysis
    Soleymani, Mohammad
    Garcia, David
    Jou, Brendan
    Schuller, Bjoern
    Chang, Shih-Fu
    Pantic, Maja
    IMAGE AND VISION COMPUTING, 2017, 65 : 3 - 14
  • [10] A Survey on Multimodal Sentiment Analysis
    Zhang Y.
    Rong L.
    Song D.
    Zhang P.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (05): : 426 - 438