Refined Commonsense Knowledge From Large-Scale Web Contents

被引:4
|
作者
Nguyen, Tuan-Phong [1 ]
Razniewski, Simon [1 ]
Romero, Julien [2 ]
Weikum, Gerhard [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[2] Telecom SudParis, F-91000 Evry, France
关键词
Semantics; Commonsense reasoning; Knowledge representation; Dinosaurs; Cleaning; Taxonomy; Task analysis; Commonsense knowledge; knowledge base construction; SENSE; CYC;
D O I
10.1109/TKDE.2022.3206505
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commonsense knowledge (CSK) about concepts and their properties is helpful for AI applications. Prior works, such as ConceptNet, have compiled large CSK collections. However, they are restricted in their expressiveness to subject-predicate-object (SPO) triples with simple concepts for S and strings for P and O. This paper presents a method called Ascent++ to automatically build a large-scale knowledge base (KB) of CSK assertions, with refined expressiveness and both better precision and recall than prior works. Ascent++ goes beyond SPO triples by capturing composite concepts with subgroups and aspects, and by refining assertions with semantic facets. The latter is essential to express the temporal and spatial validity of assertions and further qualifiers. Furthermore, Ascent++ combines open information extraction (OpenIE) with judicious cleaning and ranking by typicality and saliency scores. For high coverage, our method taps into the large-scale crawl C4 with broad web contents. The evaluation with human judgments shows the superior quality of the Ascent++ KB, and an extrinsic evaluation for QA-support tasks underlines the benefits of Ascent++. A web interface, data, and code can be accessed at https://ascentpp.mpi-inf.mpg.de/.
引用
收藏
页码:8431 / 8447
页数:17
相关论文
共 50 条
  • [1] Neural Word Representations from Large-Scale Commonsense Knowledge
    Chen, Jiaqiang
    Tandon, Niket
    de Melo, Gerard
    [J]. 2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 225 - 228
  • [2] Large-Scale Commonsense Knowledge for Default Logic Reasoning
    Järv P.
    Tammet T.
    Verrev M.
    Draheim D.
    [J]. SN Computer Science, 4 (5)
  • [3] Large Language Models as Commonsense Knowledge for Large-Scale Task Planning
    Zhao, Zirui
    Lee, Wee Sun
    Hsu, David
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Extracting large-scale knowledge bases from the web
    Kumar, R
    Raghavan, P
    Rajagopalan, S
    Tomkins, A
    [J]. PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 639 - 650
  • [5] WebBrain: Joint Neural Learning of Large-Scale Commonsense Knowledge
    Chen, Jiaqiang
    Tandon, Niket
    Hariman, Charles Darwis
    de Melo, Gerard
    [J]. SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 102 - 118
  • [6] Mining Large-scale Event Knowledge from Web Text
    Cao, Ya-nan
    Zhang, Peng
    Guo, Jing
    Guo, Li
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 478 - 487
  • [7] Commonsense Knowledge Mining from the Web
    Yu, Chi-Hsin
    Chen, Hsin-Hsi
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1480 - 1485
  • [8] KnoWeb - A knowledge web for large-scale, evolving distributed knowledge resources
    Daniel, RS
    Kiss, PA
    Yalowitz, JS
    [J]. 1998 IEEE INFORMATION TECHNOLOGY CONFERENCE, PROCEEDINGS, 1998, : 75 - 78
  • [9] Building a large-scale commonsense knowledge base by converting an existing one in a different language
    Jung, Yuchul
    Lee, Joo-Young
    Kim, Youngho
    Park, Jaehyun
    Myaeng, Sung-Hyon
    Rim, Hae-Chang
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 23 - +
  • [10] Acquiring Comparative Commonsense Knowledge from the Web
    Tandon, Niket
    de Melo, Gerard
    Weikum, Gerhard
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 166 - 172