XINFOTABS: Evaluating Multilingual Tabular Natural Language Inference

被引:0
|
作者
Minhas, Bhavnick [1 ]
Shankhdhar, Anant [1 ]
Gupta, Vivek [2 ]
Aggrawal, Divyanshu [3 ]
Zhang, Shuo [4 ]
机构
[1] Indian Inst Technol, Gauhati, India
[2] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
[3] Delhi Technol Univ, Delhi, India
[4] Bloomberg, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to reason about tabular or semi-structured knowledge is a fundamental problem for today's Natural Language Processing (NLP) systems. While significant progress has been achieved in the direction of tabular reasoning, these advances are limited to English due to the absence of multilingual benchmark datasets for semi-structured data. In this paper, we use machine translation methods to construct a multilingual tabular natural language inference (TNLI) dataset, namely XINFOTABS, which expands the English TNLI dataset of INFOTABS to ten diverse languages. We also present several baselines for multilingual tabular reasoning, e.g., machine translation-based methods and cross-lingual TNLI. We discover that the XINFOTABS evaluation suite is both practical and challenging. As a result, this dataset will contribute to increased linguistic inclusion in tabular reasoning research and applications.
引用
收藏
页码:59 / 77
页数:19
相关论文
共 50 条
  • [1] A semantics-aware approach for multilingual natural language inference
    Le-Hong, Phuong
    Cambria, Erik
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 611 - 639
  • [2] A semantics-aware approach for multilingual natural language inference
    Phuong Le-Hong
    Erik Cambria
    [J]. Language Resources and Evaluation, 2023, 57 : 611 - 639
  • [3] Natural Language Inference for Portuguese Using BERT and Multilingual Information
    Sobrevilla Cabezudo, Marco Antonio
    Inacio, Marcio
    Rodrigues, Ana Carolina
    Casanova, Edresson
    de Sousa, Rogerio Figueredo
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 346 - 356
  • [4] A Study of the State of the Art Approaches and Datasets for Multilingual Natural Language Inference
    Renjit, Sara
    Idicula, Sumam Mary
    [J]. Neural Processing Letters, 2024, 56 (06)
  • [5] Evaluating Deep Learning Techniques for Natural Language Inference
    Eleftheriadis, Petros
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [6] Evaluating BERT for natural language inference: A case study on the CommitmentBank
    Jiang, Nanjiang
    de Marneffe, Marie-Catherine
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6086 - 6091
  • [7] Evaluating Natural Language Inference Models: A Metamorphic Testing Approach
    Jiang, Mingyue
    Bao, Houzhen
    Tu, Kaiyi
    Zhang, Xiao-Yi
    Ding, Zuohua
    [J]. 2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 220 - 230
  • [8] Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference
    Hu, Hai
    Zhou, He
    Tian, Zuoyu
    Zhang, Yiwen
    Ma, Yina
    Li, Yanting
    Nie, Yixin
    Richardson, Kyle
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3770 - 3785
  • [9] Language model for multilingual natural language generation
    Zhang, Dongmo
    Ge, Yong
    Yao, Tianfang
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (07): : 944 - 947
  • [10] SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference
    Schmitt, Martin
    Schuetze, Hinrich
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 902 - 914