Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills

被引:0
|
作者
Yoran, Ori [1 ]
Talmor, Alon [1 ]
Berant, Jonathan [1 ]
机构
[1] Tel Aviv Univ, Tel Aviv, Israel
来源
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Models pre-trained with a language modeling objective possess ample world knowledge and language skills, but are known to struggle in tasks that require reasoning. In this work, we propose to leverage semi-structured tables, and automatically generate at scale question-paragraph pairs, where answering the question requires reasoning over multiple facts in the paragraph. We add a pre-training step over this synthetic data, which includes examples that require 16 different reasoning skills such as number comparison, conjunction, and fact composition. To improve data efficiency, we sample examples from reasoning skills where the model currently errs. We evaluate our approach on three reasoning-focused reading comprehension datasets, and show that our model, PReasM, substantially outperforms T5, a popular pre-trained encoder-decoder model. Moreover, sampling examples based on model errors leads to faster training and higher performance.
引用
收藏
页码:6016 / 6031
页数:16
相关论文
共 30 条
  • [1] Turning the tables: language and spatial reasoning
    Li, P
    Gleitman, L
    COGNITION, 2002, 83 (03) : 265 - 294
  • [2] Neural Multi-step Reasoning for Question Answering on Semi-structured Tables
    Haug, Till
    Ganea, Octavian-Eugen
    Grnarova, Paulina
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 611 - 617
  • [3] Compositional Semantic Parsing on Semi-Structured Tables
    Pasupat, Panupong
    Liang, Percy
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1470 - 1480
  • [4] Logical Inference for Counting on Semi-structured Tables
    Kurosawa, Tomoya
    Yanaka, Hitomi
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 84 - 96
  • [5] Tables as Semi-structured Knowledge for Question Answering
    Jauhar, Sujay Kumar
    Turney, Peter D.
    Hovy, Eduard
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 474 - 483
  • [6] INFOTABS: Inference on Tables as Semi-structured Data
    Gupta, Vivek
    Mehta, Maitrey
    Nokhiz, Pegah
    Srikumar, Vivek
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2309 - 2324
  • [7] COSATA: A Constraint Satisfaction Solver and Interpreted Language for Semi-Structured Tables of Sentences
    Jansen, Peter A.
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, 2020, : 70 - 76
  • [8] TEMPTABQA: Temporal Question Answering for Semi-Structured Tables
    Gupta, Vivek
    Kandoi, Pranshu
    Vora, Mahek Bhavesh
    Zhang, Shuo
    He, Yujie
    Reinanda, Ridho
    Srikumar, Vivek
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2431 - 2453
  • [9] INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables
    Khincha, Siddharth
    Jain, Chelsi
    Gupta, Vivek
    Kataria, Tushar
    Zhang, Shuo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2536 - 2559
  • [10] Generating Natural Language Descriptions From Tables
    Cao, Juan
    IEEE ACCESS, 2020, 8 (08): : 46206 - 46216