Populating Web-Scale Knowledge Graphs Using Distantly Supervised Relation Extraction and Validation

被引:2
|
作者
Dash, Sarthak [1 ]
Glass, Michael R. [1 ]
Gliozzo, Alfio [1 ]
Canim, Mustafa [1 ]
Rossiello, Gaetano [1 ]
机构
[1] IBM Thomas J Watson Res Ctr, IBM Res AI, Yorktown Hts, NY 10598 USA
关键词
information extraction; knowledge graphs; deep learning;
D O I
10.3390/info12080316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a fully automated system to extend knowledge graphs using external information from web-scale corpora. The designed system leverages a deep-learning-based technology for relation extraction that can be trained by a distantly supervised approach. In addition, the system uses a deep learning approach for knowledge base completion by utilizing the global structure information of the induced KG to further refine the confidence of the newly discovered relations. The designed system does not require any effort for adaptation to new languages and domains as it does not use any hand-labeled data, NLP analytics, and inference rules. Our experiments, performed on a popular academic benchmark, demonstrate that the suggested system boosts the performance of relation extraction by a wide margin, reporting error reductions of 50%, resulting in relative improvement of up to 100%. Furthermore, a web-scale experiment conducted to extend DBPedia with knowledge from Common Crawl shows that our system is not only scalable but also does not require any adaptation cost, while yielding a substantial accuracy gain.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Incorporating Instance Correlations in Distantly Supervised Relation Extraction
    Zhang, Luhao
    Hu, Linmei
    Shi, Chuan
    [J]. SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 177 - 191
  • [32] Dynamic Chunkwise CNN for Distantly Supervised Relation Extraction
    Liu, Fangbing
    Wang, Qing
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 738 - 747
  • [33] Distantly Supervised Neural Network Model for Relation Extraction
    Wang, Zhen
    Chang, Baobao
    Sui, Zhifang
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 253 - 266
  • [34] Distantly-Supervised Neural Relation Extraction with Side Information using BERT
    Moreira, Johny
    Oliveira, Chaina
    Macedo, David
    Zanchettin, Cleber
    Barbosa, Luciano
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [35] A Multi-teacher Knowledge Distillation Framework for Distantly Supervised Relation Extraction with Flexible Temperature
    Fei, Hongxiao
    Tan, Yangying
    Huang, Wenti
    Long, Jun
    Huang, Jincai
    Yang, Liu
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 103 - 116
  • [36] Improving Distantly Supervised Relation Extraction by Knowledge Base-Driven Zero Subject Resolution
    Kim, Eun-Kyung
    Choi, Key-Sun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (10) : 2551 - 2558
  • [37] A Web-scale system for scientific knowledge exploration
    Shen, Zhihong
    Ma, Hao
    Wang, Kuansan
    [J]. 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 87 - 92
  • [38] MKDAT: Multi-Level Knowledge Distillation with Adaptive Temperature for Distantly Supervised Relation Extraction
    Long, Jun
    Yin, Zhuoying
    Han, Yan
    Huang, Wenti
    [J]. INFORMATION, 2024, 15 (07)
  • [39] A Dataset for Web-Scale Knowledge Base Population
    Glass, Michael
    Gliozzo, Alfio
    [J]. SEMANTIC WEB (ESWC 2018), 2018, 10843 : 256 - 271
  • [40] WSKE: A Web-Scale Spatial Knowledge Extractor
    Lee, S.
    Kim, I.
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (12) : 12757 - 12761