Schema induction from incomplete semantic data

被引:5
|
作者
Gao, Huan [1 ,2 ]
Qi, Guilin [1 ,2 ]
Ji, Qiu [3 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Comp Network & Informat Integrat, Nanjing, Jiangsu, Peoples R China
[3] Univ Posts & Telecommun, Sch Modern Posts & Inst Modern Posts, Nanjing, Jiangsu, Peoples R China
关键词
Ontology learning; knowledge bases; open world assumption; association rule mining; semantic web;
D O I
10.3233/IDA-173514
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of the Semantic Web, more and more semantic data including many useful knowledge bases has been published on the Web. Such knowledge bases always lack expressive schema information, especially disjointness axioms and subclass axioms. This makes it difficult to perform many critical Semantic Web tasks like ontology reasoning, inconsistency handling and ontology mapping. To deal with this problem, a few approaches have been proposed to generate terminology axioms. However, they often adopt the closed world assumption which is opposite to the assumption adopted by the semantic data. This may lead to a lot of noisy negative examples so that existing learning approaches fail to perform well on such incomplete data. In this paper, a novel framework is proposed to automatically obtain disjointness axioms and subclass axioms from incomplete semantic data. This framework first obtains probabilistic type assertions by exploiting a type inference algorithm. Then a mining approach based on association rule mining is proposed to learn high-quality schema information. To address the incompleteness problem of semantic data, the mining model introduces novel definitions to compute the support and confidence for pruning false axioms. Our experimental evaluation shows promising results over several real-life incomplete knowledge bases like DBpedia and LUBM by comparing with existing relevant approaches.
引用
收藏
页码:1337 / 1353
页数:17
相关论文
共 50 条
  • [31] Properties of derivations in a Semantic Schema
    Tandareanu, Nicolae
    Ghindeanu, Mihaela
    [J]. ANNALS OF THE UNIVERSITY OF CRAIOVA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 2006, 33 : 147 - 153
  • [32] A survey on semantic schema discovery
    Kenza Kellou-Menouer
    Nikolaos Kardoulakis
    Georgia Troullinou
    Zoubida Kedad
    Dimitris Plexousakis
    Haridimos Kondylakis
    [J]. The VLDB Journal, 2022, 31 : 675 - 710
  • [33] A formalisation of semantic schema integration
    McBrien, P
    Poulovassilis, A
    [J]. INFORMATION SYSTEMS, 1998, 23 (05) : 307 - 334
  • [34] Semantic integration of XML schema
    Zhang, YF
    Liu, WY
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1058 - 1061
  • [35] A survey on semantic schema discovery
    Kellou-Menouer, Kenza
    Kardoulakis, Nikolaos
    Troullinou, Georgia
    Kedad, Zoubida
    Plexousakis, Dimitris
    Kondylakis, Haridimos
    [J]. VLDB JOURNAL, 2022, 31 (04): : 675 - 710
  • [36] AN APPROACH TO GENERATION OF SEMANTIC NETWORK FROM RELATIONAL DATABASE SCHEMA
    WU, XD
    ZHANG, DC
    [J]. CHINESE SCIENCE BULLETIN, 1991, 36 (14): : 1222 - 1225
  • [37] AN APPROACH TO GENERATION OF SEMANTIC NETWORK FROM RELATIONAL DATABASE SCHEMA
    吴信东
    张奠成
    [J]. Science Bulletin, 1991, (14) : 1222 - 1225
  • [38] Statistical Schema Induction
    Voelker, Johanna
    Niepert, Mathias
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT I, 2011, 6643 : 124 - 138
  • [39] Using Semantic Similarity for Schema Matching of Semi-structured and Linked Data
    Kettouch, Mohamed Salah
    Luca, Cristina
    Hobbs, Mike
    Dascalu, Sergiu
    [J]. PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE INTERNET TECHNOLOGIES AND APPLICATIONS (ITA), 2017, : 128 - 133
  • [40] Semantic-Similarity-Based Schema Matching for Management of Building Energy Data
    Pan, Zhiyu
    Pan, Guanchen
    Monti, Antonello
    [J]. ENERGIES, 2022, 15 (23)