Cross lingual transfer learning for sentiment analysis of Italian TripAdvisor reviews

被引:7
|
作者
Catelli, Rosario [1 ]
Bevilacqua, Luca [5 ]
Mariniello, Nicola [5 ]
di Carlo, Vladimiro Scotto [5 ]
Magaldi, Massimo [5 ]
Fujita, Hamido [2 ,3 ,4 ]
De Pietro, Giuseppe [1 ]
Esposito, Massimo [1 ]
机构
[1] Natl Res Council CNR, Inst High Performance Comp & Networking ICAR, Naples, Italy
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Natl Taipei Univ Technol, Taipei, Taiwan
[4] I Somet Inc Assoc, Morioka, Iwate, Japan
[5] Engn Ingn Informat SpA, Naples, Italy
关键词
Transfer learning; Sentiment analysis; Italian dataset; BERT; TripAdvisor; Reviews;
D O I
10.1016/j.eswa.2022.118246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the years, the attention of the scientific world towards the techniques of sentiment analysis has increased considerably, driven by industry. The arrival of the Google BERT language model has confirmed the superiority of models based on a particular structure of artificial neural network called Transformer, from which many variants have resulted. These models are generally pre-trained on large text corpora and only later specialized according to the precise task to be faced on much smaller amounts of data. For these reasons, countless versions were developed to meet the specific needs of each language, especially in the case of languages with relatively few datasets available. At the same time, models that were pre-trained for multiple languages became widespread, providing greater flexibility of use in exchange for lower performance. This study shows how the use of techniques to transfer learning from languages with high resources to languages with low resources provides an important performance increase: a multilingual BERT model fine tuned on a mixed English/Italian dataset (using for the English a literature dataset and for the Italian a reviews dataset created ad-hoc from the well-known platform TripAdvisor), provides much higher performance than models specific to Italian. Overall, the results obtained by comparing the different possible approaches indicate which one is the most promising to pursue in order to obtain the best results in low resource scenarios.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Sentiment Analysis of Consumer Reviews Using Deep Learning
    Iqbal, Amjad
    Amin, Rashid
    Iqbal, Javed
    Alroobaea, Roobaea
    Binmahfoudh, Ahmed
    Hussain, Mudassar
    SUSTAINABILITY, 2022, 14 (17)
  • [42] Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification
    Chen, Zhenpeng
    Shen, Sheng
    Hu, Ziniu
    Lu, Xuan
    Mei, Qiaozhu
    Liu, Xuanzhe
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 251 - 262
  • [43] Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification
    Chen, Zhenpeng
    Shen, Sheng
    Hu, Ziniu
    Lu, Xuan
    Mei, Qiaozhu
    Liu, Xuanzhe
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4701 - 4705
  • [44] Zero-Shot Learning for Cross-Lingual News Sentiment Classification
    Pelicon, Andraz
    Pranjic, Marko
    Miljkovic, Dragana
    Skrlj, Blaz
    Pollak, Senja
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [45] Sentiment Analysis of Movie Reviews Based on Sentiment Dictionary and Deep Learning Models
    Liu, Caihong
    Liu, Changhui
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 144 - 148
  • [46] English and Malay Cross-lingual Sentiment Lexicon Acquisition and Analysis
    Nasharuddin, Nurul Amelina
    Abdullah, Muhamad Taufik
    Azman, Azreen
    Kadir, Rabiah Abdul
    INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 467 - 475
  • [47] A Survey of Cross-lingual Sentiment Analysis: Methodologies, Models and Evaluations
    Xu, Yuemei
    Cao, Han
    Du, Wanze
    Wang, Wenqing
    DATA SCIENCE AND ENGINEERING, 2022, 7 (03) : 279 - 299
  • [48] Distillation Language Adversarial Network for Cross-lingual Sentiment Analysis
    Wang, Deheng
    Yang, Aimin
    Zhou, Yongmei
    Xie, Fenfang
    Ouyang, Zhouhao
    Peng, Sancheng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 45 - 50
  • [49] A Survey of Cross-lingual Sentiment Analysis: Methodologies, Models and Evaluations
    Yuemei Xu
    Han Cao
    Wanze Du
    Wenqing Wang
    Data Science and Engineering, 2022, 7 : 279 - 299
  • [50] Sharing travel experiences on TripAdvisor: A genre analysis of negative hotel reviews written in French, Spanish and Italian
    Cenni, Irene
    JOURNAL OF PRAGMATICS, 2024, 221 : 76 - 88