LogiQA 2.0-An Improved Dataset for Logical Reasoning in Natural Language Understanding

被引:2
|
作者
Liu, Hanmeng [1 ,3 ]
Liu, Jian [2 ,3 ]
Cui, Leyang [1 ,3 ]
Teng, Zhiyang [3 ]
Duan, Nan [4 ]
Zhou, Ming [5 ]
Zhang, Yue [3 ,6 ]
机构
[1] Zhejiang Univ, Hangzhou 310007, Peoples R China
[2] Fudan Univ, Shanghai 200433, Peoples R China
[3] Westlake Univ, Sch Engn, Hangzhou 310024, Peoples R China
[4] Microsoft Res Asia, Beijing 100080, Peoples R China
[5] Langboat Technol, Beijing 100080, Peoples R China
[6] Westlake Inst Adv Study, Inst Adv Technol, Hangzhou 310024, Peoples R China
关键词
Reading comprehension; logical reasoning; natural language inference; textual inference;
D O I
10.1109/TASLP.2023.3293046
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
NLP research on logical reasoning regains momentum with the recent releases of a handful of datasets, notably LogiQA and Reclor. Logical reasoning is exploited in many probing tasks over large Pre-trained Language Models (PLMs) and downstream tasks like question-answering and dialogue systems. In this article, we release LogiQA 2.0. The dataset is an amendment and re-annotation of LogiQA in 2020, a large-scale logical reasoning reading comprehension dataset adapted from the Chinese Civil Service Examination. We increase the data size, refine the texts with manual translation by professionals, and improve the quality by removing items with distinctive cultural features like Chinese idioms. Furthermore, we conduct a fine-grained annotation on the dataset and turn it into a two-way natural language inference (NLI) task, resulting in 35 k premise-hypothesis pairs with gold labels, making it the first large-scale NLI dataset for complex logical reasoning. Compared to Question Answering, Natural Language Inference excels in generalizability and helps downstream tasks better. We establish a baseline for logical reasoning in NLI and incite further research.
引用
收藏
页码:2947 / 2962
页数:16
相关论文
共 16 条
  • [1] LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
    Liu, Jian
    Cui, Leyang
    Liu, Hanmeng
    Huang, Dandan
    Wang, Yile
    Zhang, Yue
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3622 - 3628
  • [2] BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information
    Kazemi, Mehran
    Yuan, Quan
    Bhatia, Deepti
    Kim, Najoung
    Xu, Xin
    Imbrasaite, Vaiva
    Ramachandran, Deepak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] NeuralLog: Natural Language Inference with Joint Neural and Logical Reasoning
    Chen, Zeming
    Gao, Qiyue
    Moss, Lawrence S.
    [J]. 10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 78 - 88
  • [4] Discrete Reasoning Templates for Natural Language Understanding
    Al-Negheimish, Hadeel
    Madhyastha, Pranava
    Russo, Alessandra
    [J]. EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 80 - 87
  • [5] Logical approach to natural language understanding in a spoken dialogue system
    Villaneau, J
    Antoine, JY
    Ridoux, O
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 637 - 644
  • [6] CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
    Salewski, Leonard
    Koepke, A. Sophia
    Lensch, Hendrik P. A.
    Akata, Zeynep
    [J]. XXAI - BEYOND EXPLAINABLE AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, 2022, 13200 : 69 - 88
  • [7] CommonsenseVIS: Visualizing and Understanding Commonsense Reasoning Capabilities of Natural Language Models
    Wang, Xingbo
    Huang, Renfei
    Jin, Zhihua
    Fang, Tianqing
    Qu, Huamin
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 273 - 283
  • [8] Building Vietnamese Conversational Smart Home Dataset and Natural Language Understanding Model
    Thi Thu Trang Nguyen
    Trung Duc Anh Dang
    Quoc Viet Vu
    Park, Woomyoung
    [J]. INTERSPEECH 2022, 2022, : 5180 - 5184
  • [9] ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
    Jin, Zhihua
    Wang, Xingbo
    Cheng, Furui
    Sun, Chunhui
    Liu, Qun
    Qu, Huamin
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3594 - 3608
  • [10] Towards Detecting Fake News Using Natural Language Understanding and Reasoning in Description Logics
    Groza, Adrian
    [J]. MEASURING ONTOLOGIES FOR VALUE ENHANCEMENT: ALIGNING COMPUTING PRODUCTIVITY WITH HUMAN CREATIVITY FOR SOCIETAL ADAPTATION, MOVE 2020, 2022, 1694 : 57 - 72