Cross lingual transfer learning for sentiment analysis of Italian TripAdvisor reviews

被引:7
|
作者
Catelli, Rosario [1 ]
Bevilacqua, Luca [5 ]
Mariniello, Nicola [5 ]
di Carlo, Vladimiro Scotto [5 ]
Magaldi, Massimo [5 ]
Fujita, Hamido [2 ,3 ,4 ]
De Pietro, Giuseppe [1 ]
Esposito, Massimo [1 ]
机构
[1] Natl Res Council CNR, Inst High Performance Comp & Networking ICAR, Naples, Italy
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Natl Taipei Univ Technol, Taipei, Taiwan
[4] I Somet Inc Assoc, Morioka, Iwate, Japan
[5] Engn Ingn Informat SpA, Naples, Italy
关键词
Transfer learning; Sentiment analysis; Italian dataset; BERT; TripAdvisor; Reviews;
D O I
10.1016/j.eswa.2022.118246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the years, the attention of the scientific world towards the techniques of sentiment analysis has increased considerably, driven by industry. The arrival of the Google BERT language model has confirmed the superiority of models based on a particular structure of artificial neural network called Transformer, from which many variants have resulted. These models are generally pre-trained on large text corpora and only later specialized according to the precise task to be faced on much smaller amounts of data. For these reasons, countless versions were developed to meet the specific needs of each language, especially in the case of languages with relatively few datasets available. At the same time, models that were pre-trained for multiple languages became widespread, providing greater flexibility of use in exchange for lower performance. This study shows how the use of techniques to transfer learning from languages with high resources to languages with low resources provides an important performance increase: a multilingual BERT model fine tuned on a mixed English/Italian dataset (using for the English a literature dataset and for the Italian a reviews dataset created ad-hoc from the well-known platform TripAdvisor), provides much higher performance than models specific to Italian. Overall, the results obtained by comparing the different possible approaches indicate which one is the most promising to pursue in order to obtain the best results in low resource scenarios.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Sentiment Analysis on TripAdvisor: Are There Inconsistencies in User Reviews?
    Valdivia, Ana
    Victoria Luzon, M.
    Herrera, Francisco
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 15 - 25
  • [2] Semi-supervised Learning on Cross-Lingual Sentiment Analysis with Space Transfer
    He, Xiaonan
    Zhang, Hui
    Chao, Wenhan
    Wang, Daqing
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 371 - 377
  • [3] Sentiment Analysis in TripAdvisor
    Valdivia, Ana
    Luzon, M. Victoria
    Herrera, Francisco
    IEEE INTELLIGENT SYSTEMS, 2017, 32 (04) : 72 - 77
  • [4] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [5] Sentiment Analysis of Restaurant Customer Reviews on TripAdvisor using Naive Bayes
    Larsono, Rachmawan Adi
    Sungkono, Kelly Rossa
    Sarno, Riyanarto
    Wahyuni, Cahyaningtyas Sekar
    PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 49 - 54
  • [6] Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis
    Goldfarb-Tarrant, Seraphina
    Ross, Bjorn
    Lopez, Adam
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5691 - 5704
  • [7] Learning to Adapt Credible Knowledge in Cross-lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    He, Yanxiang
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 419 - 429
  • [8] Inconsistencies on TripAdvisor reviews: A unified index between users and Sentiment Analysis Methods
    Valdivia, Ana
    Hrabova, Emiliya
    Chaturvedi, Iti
    Luzon, M. Victoria
    Troiano, Luigi
    Cambria, Erik
    Herrera, Francisco
    NEUROCOMPUTING, 2019, 353 : 3 - 16
  • [9] A cloud-based tool for sentiment analysis in reviews about restaurants on TripAdvisor
    Aguero-Torales, M. M.
    Cobo, M. J.
    Herrera-Viedma, E.
    Lopez-Herrera, A. G.
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 392 - 399
  • [10] Cross-lingual sentiment transfer with limited resources
    Rasooli, Mohammad Sadegh
    Farra, Noura
    Radeva, Axinia
    Yu, Tao
    McKeown, Kathleen
    MACHINE TRANSLATION, 2018, 32 (1-2) : 143 - 165