Zero-Shot Learning for Cross-Lingual News Sentiment Classification

被引:23
|
作者
Pelicon, Andraz [1 ,2 ]
Pranjic, Marko [2 ,3 ]
Miljkovic, Dragana [1 ]
Skrlj, Blaz [1 ,2 ]
Pollak, Senja [1 ]
机构
[1] Jozef Stefan Inst, Ljubljana 1000, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Ljubljana 1000, Slovenia
[3] Trikoder Doo, Zagreb 10010, Croatia
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 17期
基金
欧盟地平线“2020”;
关键词
sentiment analysis; zero-shot learning; news analysis; cross-lingual classification; multilingual transformers;
D O I
10.3390/app10175993
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, we address the task of zero-shot cross-lingual news sentiment classification. Given the annotated dataset of positive, neutral, and negative news in Slovene, the aim is to develop a news classification system that assigns the sentiment category not only to Slovene news, but to news in another language without any training data required. Our system is based on the multilingual BERTmodel, while we test different approaches for handling long documents and propose a novel technique for sentiment enrichment of the BERT model as an intermediate training step. With the proposed approach, we achieve state-of-the-art performance on the sentiment analysis task on Slovenian news. We evaluate the zero-shot cross-lingual capabilities of our system on a novel news sentiment test set in Croatian. The results show that the cross-lingual approach also largely outperforms the majority classifier, as well as all settings without sentiment enrichment in pre-training.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Zero-Shot Cross-Lingual Transfer with Meta Learning
    Nooralahzadeh, Farhad
    Bekoulis, Giannis
    Bjerva, Johannes
    Augenstein, Isabelle
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4547 - 4562
  • [2] Cross-lingual Contextualized Topic Models with Zero-shot Learning
    Bianchi, Federico
    Terragni, Silvia
    Hovy, Dirk
    Nozza, Debora
    Fersini, Elisabetta
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1676 - 1683
  • [3] Prompt-based learning framework for zero-shot cross-lingual text classification
    Feng, Kai
    Huang, Lan
    Wang, Kangping
    Wei, Wei
    Zhang, Rui
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [4] Zero-Shot Cross-lingual Semantic Parsing
    Sherborne, Tom
    Lapata, Mirella
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4134 - 4153
  • [5] Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification
    Xenouleas, Stratos
    Tsoukara, Alexia
    Panagiotakis, Giannis
    Chalkidis, Ilias
    Androutsopoulos, Ion
    [J]. PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [6] Curriculum meta-learning for zero-shot cross-lingual transfer
    Doan, Toan
    Le, Bac
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [7] Zero-shot learning based cross-lingual sentiment analysis for sanskrit text with insufficient labeled data
    Kumar, Puneet
    Pathania, Kshitij
    Raman, Balasubramanian
    [J]. APPLIED INTELLIGENCE, 2023, 53 (09) : 10096 - 10113
  • [8] Rumour Detection via Zero-Shot Cross-Lingual Transfer Learning
    Tian, Lin
    Zhang, Xiuzhen
    Lau, Jey Han
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 603 - 618
  • [9] Zero-shot learning based cross-lingual sentiment analysis for sanskrit text with insufficient labeled data
    Puneet Kumar
    Kshitij Pathania
    Balasubramanian Raman
    [J]. Applied Intelligence, 2023, 53 : 10096 - 10113
  • [10] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170