Matching large ontologies: A divide-and-conquer approach

被引:122
|
作者
Hu, Wei [1 ]
Qu, Yuzhong [1 ]
Cheng, Gong [1 ]
机构
[1] SE Univ, Sch Engn & Comp Sci, Nanjing 210096, Peoples R China
关键词
ontology matching; data integration; semantic heterogeneity;
D O I
10.1016/j.datak.2008.06.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ontologies proliferate with the progress of the Semantic Web. Ontology matching is an important way of establishing interoperability between (Semantic) Web applications that use different but related ontologies. Due to their sizes and monolithic nature, large ontologies regarding real world domains bring a new challenge to the state of the art ontology matching technology. In this paper, we propose a divide-and-conquer approach to matching large ontologies. We develop a structure-based partitioning algorithm, which partitions entities of each ontology into a set of small clusters and constructs blocks by assigning RDF Sentences to those clusters. Then, the blocks from different ontologies are matched based on precalculated anchors, and the block mappings holding high similarities are selected. Finally, two powerful matchers, V-Doc and GMO, are employed to discover alignments in the block mappings. Comprehensive evaluation on both synthetic and real world data sets demonstrates that our approach both solves the scalability problem and achieves good precision and recall with significant reduction of execution time. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:140 / 160
页数:21
相关论文
共 50 条
  • [21] Divide-and-conquer approach for the exemplar breakpoint distance
    Nguyen, CT
    Tay, YC
    Zhang, LX
    BIOINFORMATICS, 2005, 21 (10) : 2171 - 2176
  • [22] A divide-and-conquer approach to acquire syntactic categories
    Gamallo, P
    Lopes, GP
    Da Silva, JF
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2004, 3264 : 151 - 162
  • [23] HEADINGS, OR DIVIDE-AND-CONQUER
    DOLLE, R
    JOURNAL OF ENVIRONMENTAL HEALTH, 1990, 53 (03) : 56 - 56
  • [24] MULTIDIMENSIONAL DIVIDE-AND-CONQUER
    BENTLEY, JL
    COMMUNICATIONS OF THE ACM, 1980, 23 (04) : 214 - 229
  • [25] A divide-and-conquer approach for content replication in WMNs
    Al-Arnaout, Zakwan
    Fu, Qiarig
    Frean, Marcus
    COMPUTER NETWORKS, 2013, 57 (18) : 3914 - 3928
  • [26] Gaussian Process Learning: A Divide-and-Conquer Approach
    Li, Wenye
    ADVANCES IN NEURAL NETWORKS - ISNN 2014, 2014, 8866 : 262 - 269
  • [27] A Divide-and-Conquer Approach to the Summarization of Long Documents
    Gidiotis, Alexios
    Tsoumakas, Grigorios
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 3029 - 3040
  • [28] A divide-and-conquer approach to compressed sensing MRI
    Sun, Liyan
    Fan, Zhiwen
    Ding, Xinghao
    Cai, Congbo
    Huang, Yue
    Paisley, John
    MAGNETIC RESONANCE IMAGING, 2019, 63 : 37 - 48
  • [29] A Divide-and-Conquer Approach for Large-scale Multi-label Learning
    Zhang, Wenjie
    Wang, Xiangfeng
    Yan, Junchi
    Zha, Hongyuan
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 398 - 401
  • [30] RANDOMIZED DIVIDE-AND-CONQUER: IMPROVED PATH, MATCHING, AND PACKING ALGORITHMS
    Chen, Jianer
    Kneis, Joachim
    Lu, Songjian
    Moelle, Daniel
    Richter, Stefan
    Rossmanith, Peter
    Sze, Sing-Hoi
    Zhang, Fenghui
    SIAM JOURNAL ON COMPUTING, 2009, 38 (06) : 2526 - 2547