SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference

被引:0
|
作者
Schmitt, Martin [1 ]
Schuetze, Hinrich [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Ctr Informat & Language Proc CIS, Munich, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SherLIiC,(1) a testbed for lexical inference in context (LIiC), consisting of 3985 manually annotated inference rule candidates (InfCands), accompanied by (i) similar to 960k unlabeled InfCands, and (ii) similar to 190k typed textual relations between Freebase entities extracted from the large entity-linked corpus ClueWeb09. Each InfCand consists of one of these relations, expressed as a lemmatized dependency path, and two argument placeholders, each linked to one or more Freebase types. Due to our candidate selection process based on strong distributional evidence, SherLIiC is much harder than existing testbeds because distributional evidence is of little utility in the classification of InfCands. We also show that, due to its construction, many of SherLIiC's correct InfCands are novel and missing from existing rule bases. We evaluate a number of strong baselines on SherLIiC, ranging from semantic vector space models to state of the art neural models of natural language inference (NLI). We show that SherLIiC poses a tough challenge to existing NLI systems.
引用
收藏
页码:902 / 914
页数:13
相关论文
共 50 条
  • [1] Evaluating Deep Learning Techniques for Natural Language Inference
    Eleftheriadis, Petros
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [2] XINFOTABS: Evaluating Multilingual Tabular Natural Language Inference
    Minhas, Bhavnick
    Shankhdhar, Anant
    Gupta, Vivek
    Aggrawal, Divyanshu
    Zhang, Shuo
    [J]. PROCEEDINGS OF THE FIFTH FACT EXTRACTION AND VERIFICATION WORKSHOP (FEVER 2022), 2022, : 59 - 77
  • [3] Language Models for Lexical Inference in Context
    Schmitt, Martin
    Schuetze, Hinrich
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1267 - 1280
  • [4] Evaluating BERT for natural language inference: A case study on the CommitmentBank
    Jiang, Nanjiang
    de Marneffe, Marie-Catherine
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6086 - 6091
  • [5] Evaluating Natural Language Inference Models: A Metamorphic Testing Approach
    Jiang, Mingyue
    Bao, Houzhen
    Tu, Kaiyi
    Zhang, Xiao-Yi
    Ding, Zuohua
    [J]. 2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 220 - 230
  • [6] Knowledge Augmented Inference Network for Natural Language Inference
    Jiang, Shan
    Li, Bohan
    Liu, Chunhua
    Yu, Dong
    [J]. KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 129 - 135
  • [7] Exploring Lexical Irregularities in Hypothesis-Only Models of Natural Language Inference
    Hu, Qingyuan
    Zhang, Yi
    Misra, Kanishka
    Rayz, Julia Taylor
    [J]. PROCEEDINGS OF 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2020), 2020, : 125 - 130
  • [8] Natural Language Inference in Coq
    Chatzikyriakidis, Stergios
    Luo, Zhaohui
    [J]. JOURNAL OF LOGIC LANGUAGE AND INFORMATION, 2014, 23 (04) : 441 - 480
  • [9] Communication and inference in natural language
    da Costa, Jorge Campos
    [J]. LETRAS DE HOJE-ESTUDOS E DEBATES EM LINGUISTICA LITERATURA E LINGUA PORTUGUESA, 2005, 40 (01): : 107 - 133
  • [10] Natural Language Inference in Coq
    Stergios Chatzikyriakidis
    Zhaohui Luo
    [J]. Journal of Logic, Language and Information, 2014, 23 : 441 - 480