Improving Cross-lingual Text Classification with Zero-shot Instance-Weighting

被引:0
|
作者
Li, Irene [1 ]
Sen, Prithviraj [2 ]
Zhu, Huaiyu [2 ]
Li, Yunyao [2 ]
Radev, Dragomir [1 ]
机构
[1] Yale Univ, New Haven, CT 06520 USA
[2] IBM Res Almaden, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual text classification (CLTC) is a challenging task made even harder still due to the lack of labeled data in low-resource languages. In this paper, we propose zero-shot instance-weighting, a general model-agnostic zero-shot learning framework for improving CLTC by leveraging source instance weighting. It adds a module on top of pre-trained language models for similarity computation of instance weights, thus aligning each source instance to the target language. During training, the framework utilizes gradient descent that is weighted by instance weights to update parameters. We evaluate this framework over seven target languages on three fundamental tasks and show its effectiveness and extensibility, by improving on F1 score up to 4% in single-source transfer and 8% in multi-source transfer. To the best of our knowledge, our method is the first to apply instance weighting in zeroshot CLTC. It is simple yet effective and easily extensible into multi-source transfer.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [21] Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph
    Zhou, Yucheng
    Geng, Xiubo
    Shen, Tao
    Zhang, Wenqiang
    Jiang, Daxin
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5822 - 5834
  • [22] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
  • [23] Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing
    Shi, Freda
    Gimpel, Kevin
    Livescu, Karen
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6547 - 6563
  • [24] Curriculum meta-learning for zero-shot cross-lingual transfer
    Doan, Toan
    Le, Bac
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [25] Analyzing Zero-shot Cross-lingual Transfer in Supervised NLP Tasks
    Choi, Hyunjin
    Kim, Judong
    Joe, Seongho
    Min, Seungjai
    Gwon, Youngjune
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9608 - 9613
  • [26] Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
    Riabi, Arij
    Scialom, Thomas
    Keraron, Rachel
    Sagot, Benoit
    Seddah, Djame
    Staiano, Jacopo
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7016 - 7030
  • [27] Rumour Detection via Zero-Shot Cross-Lingual Transfer Learning
    Tian, Lin
    Zhang, Xiuzhen
    Lau, Jey Han
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 603 - 618
  • [28] Cross-Lingual Transfer in Zero-Shot Cross-Language Entity Linking
    Schumacher, Elliot
    Mayfield, James
    Dredze, Mark
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 583 - 595
  • [29] Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
    Nozza, Debora
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 907 - 914
  • [30] Zero-shot Cross-Lingual Phonetic Recognition with External Language Embedding
    Gao, Heting
    Ni, Junrui
    Zhang, Yang
    Qian, Kaizhi
    Chang, Shiyu
    Hasegawa-Johnson, Mark
    [J]. INTERSPEECH 2021, 2021, : 1304 - 1308