Baselines and Test Data for Cross-Lingual Inference

被引:0
|
作者
Agic, Zeljko [1 ]
Schluter, Natalie [1 ]
机构
[1] IT Univ Copenhagen, Dept Comp Sci, Rued Langgaards Vej 7, DK-2300 Copenhagen S, Denmark
关键词
natural language inference; cross-lingual methods; test data;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The recent years have seen a revival of interest in textual entailment, sparked by i) the emergence of powerful deep neural network learners for natural language processing and ii) the timely development of large-scale evaluation datasets such as SNLI. Recast as natural language inference, the problem now amounts to detecting the relation between pairs of statements: they either contradict or entail one another, or they are mutually neutral. Current research in natural language inference is effectively exclusive to English. In this paper, we propose to advance the research in SNLI-style natural language inference toward multilingual evaluation. To that end, we provide test data for four major languages: Arabic, French, Spanish, and Russian. We experiment with a set of baselines. Our systems are based on cross-lingual word embeddings and machine translation. While our best system scores an average accuracy of just over 75%, we focus largely on enabling further research in multilingual inference.
引用
收藏
页码:3890 / 3894
页数:5
相关论文
共 50 条
  • [41] Universal Cross-Lingual Data Generation for Low Resource ASR
    Wang, Wei
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 973 - 983
  • [42] Multilingual and Cross-Lingual Intent Detection from Spoken Data
    Gerz, Daniela
    Su, Pei-Hao
    Kusztos, Razvan
    Mondal, Avishek
    Lis, Michal
    Singhal, Eshan
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Vulic, Ivan
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7468 - 7475
  • [43] A Cross-Lingual Summarization method based on cross-lingual Fact-relationship Graph Generation
    Zhang, Yongbing
    Gao, Shengxiang
    Huang, Yuxin
    Tan, Kaiwen
    Yu, Zhengtao
    PATTERN RECOGNITION, 2024, 146
  • [44] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    Applied Intelligence, 2022, 52 : 3156 - 3174
  • [45] The Role of Test, Classroom, and Home Language Correspondence in Cross-Lingual Testing
    Alvin Vista
    The Asia-Pacific Education Researcher, 2022, 31 : 711 - 723
  • [46] The Role of Test, Classroom, and Home Language Correspondence in Cross-Lingual Testing
    Vista, Alvin
    ASIA-PACIFIC EDUCATION RESEARCHER, 2022, 31 (06): : 711 - 723
  • [47] Multi-lingual and Cross-lingual timeline extraction
    Laparra, Egoitz
    Agerri, Rodrigo
    Aldabe, Itziar
    Rigau, German
    KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 77 - 89
  • [48] ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
    Casanova, Edresson
    Shulby, Christopher
    Korolev, Alexander
    Candido Junior, Arnaldo
    Soares, Anderson da Silva
    Aluisio, Sandra
    Ponti, Moacir Antonelli
    INTERSPEECH 2023, 2023, : 1244 - 1248
  • [49] Cross-lingual embedding for cross-lingual question retrieval in low-resource community question answering
    HajiAminShirazi, Shahrzad
    Momtazi, Saeedeh
    MACHINE TRANSLATION, 2020, 34 (04) : 287 - 303
  • [50] Translation-Based Matching Adversarial Network for Cross-Lingual Natural Language Inference
    Qi, Kunxun
    Du, Jianfeng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8632 - 8639