Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph

被引:0
|
作者
Xu, Liyan [1 ]
Zhang, Xuchao [2 ]
Zong, Bo [2 ]
Liu, Yanchi [2 ]
Cheng, Wei [2 ]
Ni, Jingchao [2 ]
Chen, Haifeng [2 ]
Zhao, Liang [1 ]
Choi, Jinho D. [1 ]
机构
[1] Emory Univ, Dept Comp Sci, Atlanta, GA 30322 USA
[2] NEC Labs Amer, Princeton, NJ USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided MRC models, we propose to adopt the inter-sentence syntactic relations, in addition to the rudimentary intra-sentence relations, to further utilize the syntactic dependencies in the multi-sentence input of the MRC task. In our approach, we build the Inter-Sentence Dependency Graph (ISDG) connecting dependency trees to form global syntactic relations across sentences. We then propose the ISDG encoder that encodes the global dependency graph, addressing the inter-sentence relations via both one-hop and multi-hop dependency paths explicitly. Experiments on three multilingual MRC datasets (XQuAD, MLQA, TyDiQA-GoldP) show that our encoder that is only trained on English is able to improve the zero-shot performance on all 14 test sets covering 8 languages, with up to 3.8 F1/5.2 EM improvement on-average, and 5.2 F1/11.2 EM on certain languages. Further analysis shows the improvement can be attributed to the attention on the cross-linguistically consistent syntactic path. Our code is available at https://github.com/lxucs/multilingual-mrc-isdg.
引用
下载
收藏
页码:11538 / 11546
页数:9
相关论文
共 50 条
  • [21] Evaluating morphological typology in zero-shot cross-lingual transfer
    Martinez-Garcia, Antonio
    Badia, Toni
    Barnes, Jeremy
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3136 - 3153
  • [22] Reinforced Zero-Shot Cross-Lingual Neural Headline Generation
    Ayana
    Chen, Yun
    Yang, Cheng
    Liu, Zhiyuan
    Sun, Maosong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 2572 - 2584
  • [23] Towards zero-shot cross-lingual named entity disambiguation
    Barrena, Ander
    Soroa, Aitor
    Agirre, Eneko
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [24] (ALMOST) ZERO-SHOT CROSS-LINGUAL SPOKEN LANGUAGE UNDERSTANDING
    Upadhyay, Shyam
    Faruqui, Manaal
    Tur, Gokhan
    Hakkani-Tur, Dilek
    Heck, Larry
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6034 - 6038
  • [25] Cross-lingual Contextualized Topic Models with Zero-shot Learning
    Bianchi, Federico
    Terragni, Silvia
    Hovy, Dirk
    Nozza, Debora
    Fersini, Elisabetta
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1676 - 1683
  • [26] Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 142 - 157
  • [27] Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
    Xu, Qiantong
    Baevski, Alexei
    Auli, Michael
    INTERSPEECH 2022, 2022, : 2113 - 2117
  • [28] Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Dong, Li
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 15 - 26
  • [29] Zero-Shot Learning for Cross-Lingual News Sentiment Classification
    Pelicon, Andraz
    Pranjic, Marko
    Miljkovic, Dragana
    Skrlj, Blaz
    Pollak, Senja
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [30] XCMRC: Evaluating Cross-Lingual Machine Reading Comprehension
    Liu, Pengyuan
    Deng, Yuning
    Zhu, Chenghao
    Hu, Han
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 552 - 564