Explaining Simple Natural Language Inference

被引:0
|
作者
Kalouli, Aikaterini-Lida [1 ]
Buis, Annebeth [2 ]
Real, Livy [3 ]
Palmer, Martha [2 ]
de Paiva, Valeria [4 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] Univ Colorado, Boulder, CO 80309 USA
[3] Univ Sao Paulo, Sao Paulo, Brazil
[4] Univ Birmingham, Birmingham, W Midlands, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The vast amount of research introducing new corpora and techniques for (semi-)automatically annotating corpora shows the important role that datasets play in today's research, especially in the machine learning community. This rapid development raises concerns about the quality of the datasets created and consequently of the models trained, as recently discussed with respect to the Natural Language Inference (NLI) task. In this work we conduct an annotation experiment based on a small subset of the SICK corpus. The experiment reveals several problems in the annotation guidelines, and various challenges of the NLI task itself. Our quantitative evaluation of the experiment allows us to assign our empirical observations to specific linguistic phenomena and leads us to recommendations for future annotation tasks, for NLI and possibly for other tasks.
引用
收藏
页码:132 / 143
页数:12
相关论文
共 50 条
  • [21] Natural language directed inference from ontologies
    Mellish, Chris
    Pan, Jeff Z.
    ARTIFICIAL INTELLIGENCE, 2008, 172 (10) : 1285 - 1315
  • [22] An Exploration of Dropout with RNNs for Natural Language Inference
    Gajbhiye, Amit
    Jaf, Sardar
    Al Moubayed, Noura
    McGough, A. Stephen
    Bradley, Steven
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 157 - 167
  • [23] Natural language as the basis for meaning representation and inference
    Dagan, Ido
    Bar-Haim, Roy
    Szpektor, Idan
    Greental, Iddo
    Shnarchl, Eyal
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 151 - +
  • [24] Investigating Reasons for Disagreement in Natural Language Inference
    Jiang, Nan-Jiang
    de Marneffe, Marie-Catherine
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1357 - 1374
  • [25] Data and Representation for Turkish Natural Language Inference
    Budur, Emrah
    Ozcelik, Riza
    Gungor, Tunga
    Potts, Christopher
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8253 - 8267
  • [26] Syntactic Knowledge for Natural Language Inference in Portuguese
    Fonseca, Erick
    Aluisio, Sandra M.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 242 - 252
  • [27] FarsTail: a Persian natural language inference dataset
    Amirkhani, Hossein
    AzariJafari, Mohammad
    Faridan-Jahromi, Soroush
    Kouhkan, Zeinab
    Pourjafari, Zohreh
    Amirak, Azadeh
    SOFT COMPUTING, 2023,
  • [28] Natural Language Inference Based on Adversarial Regularization
    Liu G.-C.
    Cao Y.
    Xu J.-M.
    Xu B.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (08): : 1455 - 1463
  • [29] Convolutional Interaction Network for Natural Language Inference
    Gong, Jingjing
    Qiu, Xipeng
    Chen, Xinchi
    Liang, Dong
    Huang, Xuanjing
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1576 - 1585
  • [30] Explaining Commonalities of Clusters of RDF Resources in Natural Language
    Colucci, Simona
    Donini, Francesco M.
    Di Sciascio, Eugenio
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2024, 2024, 14670 : 160 - 169