Matcher Composition Methods for Automatic Schema Matching

被引:5
|
作者
Nikovski, Daniel [1 ]
Esenther, Alan [1 ]
Ye, Xiang [1 ]
Shiba, Mitsuteru [2 ]
Takayama, Shigenobu [3 ]
机构
[1] Mitsubishi Elect Res Labs, 201 Broadway, Cambridge, MA 02139 USA
[2] Mitsubishi Electr Corp, Kanagawa 2478501, Japan
[3] Mitsubishi Elect Informat Syst Cor, Kanagawa 2470065, Japan
来源
ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012 | 2013年 / 141卷
关键词
Data integration; Virtual databases; Uncertain schema matching; NETWORKS;
D O I
10.1007/978-3-642-40654-6_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of automating the process of deciding whether two data schema elements match (that is, refer to the same actual object or concept), and propose several methods for combining evidence computed by multiple basic matchers. One class of methods uses Bayesian networks to account for the conditional dependency between the similarity values produced by individual matchers that use the same or similar information, so as to avoid overconfidence in match probability estimates and improve the accuracy of matching. Another class of methods relies on optimization switches that mitigate this dependency in a domain-independent manner. Experimental results under several testing protocols suggest that the matching accuracy of the Bayesian composite matchers can significantly exceed that of the individual component matchers, and the careful selection of optimization switches can improve matching accuracy even further.
引用
收藏
页码:108 / 123
页数:16
相关论文
共 50 条
  • [11] Managing uncertainty in schema matcher ensembles
    Marie, Anan
    Gal, Avigdor
    SCALABLE UNCERTAINTY MANAGEMENT, PROCEEDINGS, 2007, 4772 : 60 - +
  • [12] Ontology-aided automatic schema matching
    Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
    不详
    Ruan Jian Xue Bao, 2009, 2 (234-245):
  • [13] Poster session:: An indexing structure for automatic schema matching
    Duchateau, Fabien
    Bellahsene, Zohra
    Roantree, Mark
    Roche, Mathieu
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 485 - +
  • [14] Automatic generation of probabilistic relationships for improving schema matching
    Po, Laura
    Sorrentino, Serena
    INFORMATION SYSTEMS, 2011, 36 (02) : 192 - 208
  • [15] A Semi-automatic Approach to Reduce Uncertainty of Schema Matching
    Xie, Gang
    Lan, Yuqing
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 95 - 98
  • [16] An Automatic Matcher and Linker for Transportation Datasets
    Masri, Ali
    Zeitouni, Karine
    Kedad, Zoubida
    Leroy, Bertrand
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (01)
  • [17] Automatic Schema-Independent Linked Data Instance Matching System
    Khai Nguyen
    Ichise, Ryutaro
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2017, 13 (01) : 82 - 103
  • [18] An unsupervised instance matcher for schema-free RDF data
    Kejriwal, Mayank
    Miranker, Daniel P.
    JOURNAL OF WEB SEMANTICS, 2015, 35 : 102 - 123
  • [19] EXSMAL: EDI/XML semi-automatic schema matching algorithm
    Chukmol, U
    Rifaieh, R
    Benharkat, NA
    CEC 2005: Seventh IEEE International Conference on E-Commerce Technology, Proceedings, 2005, : 422 - 425
  • [20] Schema Normalization for Improving Schema Matching
    Sorrentino, Serena
    Bergamaschi, Sonia
    Gawinecki, Maciej
    Po, Laura
    CONCEPTUAL MODELING - ER 2009, PROCEEDINGS, 2009, 5829 : 280 - +