Tab2Know: Building a Knowledge Base from Tables in Scientific Papers

被引:3
|
作者
Kruit, Benno [1 ,2 ]
He, Hongyu [1 ]
Urbani, Jacopo [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
[2] Ctr Wiskunde & Informat, Amsterdam, Netherlands
来源
关键词
WEB;
D O I
10.1007/978-3-030-62419-4_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables in scientific papers contain a wealth of valuable knowledge for the scientific enterprise. To help the many of us who frequently consult this type of knowledge, we present Tab2Know, a new end-to-end system to build a Knowledge Base (KB) from tables in scientific papers. Tab2Know addresses the challenge of automatically interpreting the tables in papers and of disambiguating the entities that they contain. To solve these problems, we propose a pipeline that employs both statistical-based classifiers and logic-based reasoning. First, our pipeline applies weakly supervised classifiers to recognize the type of tables and columns, with the help of a data labeling system and an ontology specifically designed for our purpose. Then, logic-based reasoning is used to link equivalent entities (via sameAs links) in different tables. An empirical evaluation of our approach using a corpus of papers in the Computer Science domain has returned satisfactory performance. This suggests that ours is a promising step to create a large-scale KB of scientific knowledge.
引用
收藏
页码:349 / 365
页数:17
相关论文
共 50 条
  • [1] Knowledge transfer from agri-food scientific papers to a knowledge base
    Trojczak, Rafal
    Trypuz, Robert
    Mazurek, Anna
    Kulicki, Piotr
    [J]. PROCEEDINGS OF THE 2015 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 5 : 1705 - 1712
  • [2] A framework for building scientific knowledge grids applied to thermochemical tables
    von Laszewski, G
    Ruscic, B
    Amin, K
    Wagstrom, P
    Krishnan, S
    Nijsure, S
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2003, 17 (04): : 431 - 447
  • [3] A framework for building scientific knowledge grids applied to thermochemical tables
    Mathematics Division, Argonne National Laboratory, United States
    不详
    不详
    不详
    [J]. 1600, 431-447 (Winter 2003):
  • [4] Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers
    Srinivasa, K.
    Thilagam, P. Santhi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
  • [5] Data Acquisition and Information Extraction for Scientific Knowledge Base Building
    Andruszkiewicz, Piotr
    Rybinski, Henryk
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 256 - 259
  • [6] The scientific knowledge base of special education: Do we know what we think we know?
    Gallagher, DJ
    [J]. EXCEPTIONAL CHILDREN, 1998, 64 (04) : 493 - 502
  • [7] Automatic construction of knowledge base from biological papers
    Ohta, Y
    Yamamoto, Y
    Okazaki, T
    Uchiyama, I
    Takagi, T
    [J]. ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, 1997, : 218 - 225
  • [8] Building Candidate Monolingual Parallel Corpus from Scientific Papers
    Ilyas, Ridwan
    Widiyantoro, Dwi H.
    Khodra, Masayu Leylia
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 230 - 233
  • [9] Building a large knowledge base from a structured source
    Frank, G
    Farquhar, A
    Fikes, R
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (01): : 47 - 54
  • [10] An approach based on open research knowledge graph for knowledge acquisition from scientific papers
    Jiomekong, Azanzi
    Tiwari, Sanju
    [J]. ELECTRONIC LIBRARY, 2024, 42 (03): : 413 - 442