WAD-X: Improving Zero-shot Cross-lingual Transfer via Adapter-based Word Alignment

被引:1
|
作者
Ahmat, Ahtamjan [1 ,2 ,3 ]
Yang, Yating [1 ,2 ,3 ]
Ma, Bo [1 ,2 ,3 ]
Dong, Rui [1 ,2 ,3 ]
Lu, Kaiwen [1 ,2 ,3 ]
Wang, Lei [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, 40-1 Beijing Rd, Urumqi 830011, Xinjiang, Peoples R China
[2] Univ Chinese Acad Sci, 19 A Yuquan Rd, Beijing 100049, Peoples R China
[3] Xinjiang Lab Minor Speech & Language Informat Pro, 40-1 Beijing Rd, Urumqi 830011, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-lingual transfer; adapter; low-resource languages; word alignment;
D O I
10.1145/3610289
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual pre-trained language models (mPLMs) have achieved remarkable performance on zero-shot cross-lingual transfer learning. However, most mPLMs implicitly encourage cross-lingual alignment in pretraining stage, making it hard to capture accurate word alignment across languages. In this paper, we propose Word-align ADapters for Cross-lingual transfer (WAD-X) to explicitly align word representations of mPLMs via language-specific subspace. Taking a mPLM as the backbone model, WAD-X constructs subspace for each source-target language pair via adapters. The adapters use statistical alignment as the prior knowledge to guide word-level aligning in the corresponding bilingual semantic subspace. We evaluate our model across a set of target languages on three zero-shot cross-lingual transfer tasks: part-of-speech tagging (POS), dependency parsing (DP), and sentiment analysis (SA). Experimental results demonstrate that our proposed model improves zero-shot cross-lingual transfer on three benchmarks, with improvements of 2.19, 2.50, and 1.61 points in POS, DP, and SA tasks over strong baselines.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training
    Huang, Kuan-Hao
    Ahmad, Wasi Uddin
    Peng, Nanyun
    Chang, Kai-Wei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1684 - 1697
  • [2] XeroAlign: Zero-Shot Cross-lingual Transformer Alignment
    Gritta, Milan
    Iacobacci, Ignacio
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 371 - 381
  • [3] Zero-shot Cross-lingual Alignment for Embedding Initialization
    Ai, Xi
    Huang, Zhiyong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5997 - 6007
  • [4] Rumour Detection via Zero-Shot Cross-Lingual Transfer Learning
    Tian, Lin
    Zhang, Xiuzhen
    Lau, Jey Han
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 603 - 618
  • [5] Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization
    Ponti, Edoardo M.
    Vulic, Ivan
    Glavas, Goran
    Mrksic, Nikola
    Korhonen, Anna
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 282 - 293
  • [6] The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer
    Efimov, Pavel
    Boytsov, Leonid
    Arslanova, Elena
    Braslavski, Pavel
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 51 - 67
  • [7] Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing
    Schuster, Tal
    Ram, Ori
    Barzilay, Regina
    Globerson, Amir
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1599 - 1613
  • [8] Zero-Shot Cross-Lingual Transfer with Meta Learning
    Nooralahzadeh, Farhad
    Bekoulis, Giannis
    Bjerva, Johannes
    Augenstein, Isabelle
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4547 - 4562
  • [9] Zero-Shot Cross-Lingual Knowledge Transfer in VQA via Multimodal Distillation
    Weng, Yu
    Dong, Jun
    He, Wenbin
    Chaomurilige
    Liu, Xuan
    Liu, Zheng
    Gao, Honghao
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 11
  • [10] Zero-Shot Neural Transfer for Cross-Lingual Entity Linking
    Rijhwani, Shruti
    Xie, Jiateng
    Neubig, Graham
    Carbonell, Jaime
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6924 - 6931