Baselines and Test Data for Cross-Lingual Inference

被引:0
|
作者
Agic, Zeljko [1 ]
Schluter, Natalie [1 ]
机构
[1] IT Univ Copenhagen, Dept Comp Sci, Rued Langgaards Vej 7, DK-2300 Copenhagen S, Denmark
关键词
natural language inference; cross-lingual methods; test data;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The recent years have seen a revival of interest in textual entailment, sparked by i) the emergence of powerful deep neural network learners for natural language processing and ii) the timely development of large-scale evaluation datasets such as SNLI. Recast as natural language inference, the problem now amounts to detecting the relation between pairs of statements: they either contradict or entail one another, or they are mutually neutral. Current research in natural language inference is effectively exclusive to English. In this paper, we propose to advance the research in SNLI-style natural language inference toward multilingual evaluation. To that end, we provide test data for four major languages: Arabic, French, Spanish, and Russian. We experiment with a set of baselines. Our systems are based on cross-lingual word embeddings and machine translation. While our best system scores an average accuracy of just over 75%, we focus largely on enabling further research in multilingual inference.
引用
收藏
页码:3890 / 3894
页数:5
相关论文
共 50 条
  • [1] Cross-lingual Inference with A Chinese Entailment Graph
    Li, Tianyi
    Weber, Sabine
    Hosseini, Mohammad Javad
    Guillou, Liane
    Steedman, Mark
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1214 - 1233
  • [2] Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates
    Qi, Kunxun
    Wan, Hai
    Du, Jianfeng
    Chen, Haolan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1910 - 1923
  • [3] Cross-Lingual Transfer Learning for Statistical Type Inference
    Li, Zhiming
    Xie, Xiaofei
    Li, Haoliang
    Xu, Zhengzi
    Li, Yi
    Liu, Yang
    PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 239 - 250
  • [4] Cross-Lingual Classification of Crisis Data
    Khare, Prashant
    Burel, Gregoire
    Maynard, Diana
    Alani, Harith
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 617 - 633
  • [5] Searching the Web for Cross-lingual Parallel Data
    El-Kishky, Ahmed
    Koehn, Philipp
    Schwenk, Holger
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2417 - 2420
  • [6] Optimization of Cross-Lingual LSI Training Data
    Pozniak, John
    Bradford, Roger
    COMPUTER AND INFORMATION SCIENCE 2015, 2016, 614 : 57 - 73
  • [7] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [8] Multilingual Knowledge Base Completion by Cross-lingual Semantic Relation Inference
    Bebeshina-Clairet, Nadia
    Lafourcade, Mathieu
    PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 249 - 253
  • [9] A Deep Transfer Learning Method for Cross-Lingual Natural Language Inference
    Bandyopadhyay, Dibyanayan
    De, Arkadipta
    Gain, Baban
    Saikh, Tanik
    Ekbal, Asif
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3084 - 3092
  • [10] Cross-lingual Inflection as a Data Augmentation Method for Parsing
    Munoz-Ortiz, Alberto
    Gomez-Rodriguez, Carlos
    Vilares, David
    PROCEEDINGS OF THE THIRD WORKSHOP ON INSIGHTS FROM NEGATIVE RESULTS IN NLP (INSIGHTS 2022), 2022, : 54 - 61