Matching Attributes Across Overlapping Heterogeneous Data Sources Using Mutual Information

被引:3
|
作者
Zhao, Huimin [1 ]
机构
[1] Univ Wisconsin Milwaukee, Sheldon B Lubar Sch Business, Milwaukee, WI 53201 USA
关键词
Attribute Correspondence; Attribute Matching; Heterogeneous Databases; Information Theory; Mutual Information; SEMANTIC-INTEGRATION; SCHEMA; CORRESPONDENCES; RETRIEVAL; DATABASES;
D O I
10.4018/jdm.2010100105
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.
引用
收藏
页码:91 / 110
页数:20
相关论文
共 50 条
  • [21] Discovering Conflicts of Interest across Heterogeneous Data Sources with ConnectionLens
    Anadiotis, Angelos-Christos
    Balalau, Oana
    Bouganim, Theo
    Chimienti, Francesco
    Galhardas, Helena
    Haddad, Mhd-Yamen
    Horel, Stephane
    Manolescu, Ioana
    Youssef, Youssr
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4670 - 4674
  • [22] DISENTANGLING OVERLAPPING ASTRONOMICAL SOURCES USING SPATIAL AND SPECTRAL INFORMATION
    Jones, David E.
    Kashyap, Vinay L.
    van Dyk, David A.
    ASTROPHYSICAL JOURNAL, 2015, 808 (02):
  • [23] Research on Semantic Integration across Heterogeneous Data Sources in Grid
    Liu, Guofeng
    Huang, Shaobin
    Cheng, Yuan
    FRONTIERS IN COMPUTER EDUCATION, 2012, 133 : 397 - 404
  • [24] Mind your vocabulary:: Query mapping across heterogeneous information sources
    Chang, CCK
    García-Molina, H
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 335 - 346
  • [25] Information integration from semantically heterogeneous biological data sources
    Caragea, D
    Bao, J
    Pathak, J
    Silvescu, A
    Andorf, C
    Dobbs, D
    Honavar, V
    Sixteenth International Workshop on Database and Expert Systems Applications, Proceedings, 2005, : 580 - 584
  • [26] Image template matching using mutual information and NP-Windows
    Dowson, N. D. H.
    Bowden, R.
    Kadir, T.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 1186 - +
  • [27] Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources
    Motro, Amihai
    Anokhin, Philipp
    INFORMATION FUSION, 2006, 7 (02) : 176 - 196
  • [28] Detection and resolution of data confliction in the integration of heterogeneous information sources
    College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100022, China
    Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2008, 1 (37-42): : 37 - 42
  • [29] Robust Multisensor Image Matching Using Bayesian Estimated Mutual Information
    Shen, Lurong
    Huang, Xinsheng
    Yan, Yuzhuang
    Zheng, Yongbin
    Xu, Wanying
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [30] Robust Multisensor Image Matching Using Bayesian Estimated Mutual Information
    Yan, Yuzhuang
    Shen, Lurong
    Zheng, Yongbin
    Xu, Wanying
    Huang, Xinsheng
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 541 - 548