Matching Attributes Across Overlapping Heterogeneous Data Sources Using Mutual Information

被引:3
|
作者
Zhao, Huimin [1 ]
机构
[1] Univ Wisconsin Milwaukee, Sheldon B Lubar Sch Business, Milwaukee, WI 53201 USA
关键词
Attribute Correspondence; Attribute Matching; Heterogeneous Databases; Information Theory; Mutual Information; SEMANTIC-INTEGRATION; SCHEMA; CORRESPONDENCES; RETRIEVAL; DATABASES;
D O I
10.4018/jdm.2010100105
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.
引用
收藏
页码:91 / 110
页数:20
相关论文
共 50 条
  • [31] An approach for integrating heterogeneous information sources in a medical data warehouse
    Kerkri E.M.
    Quantin C.
    Allaert F.A.
    Cottin Y.
    Charve P.
    Jouanot F.
    Yétongnon K.
    Journal of Medical Systems, 2001, 25 (3) : 167 - 176
  • [32] Combining schema and instance information for integrating heterogeneous data sources
    Zhao, Huimin
    Ram, Sudha
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (02) : 281 - 303
  • [33] Multi-spectral stereo image matching using mutual information
    Fookes, C
    Maeder, A
    Sridharan, S
    Cook, J
    2ND INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2004, : 961 - 968
  • [34] Automatic User Identification Method across Heterogeneous Mobility Data Sources
    Cao, Wei
    Wu, Zhengwei
    Wang, Dong
    Li, Jian
    Wu, Haishan
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 978 - 989
  • [35] Efficient top-κ search across heterogeneous XML data sources
    Li, Jianxin
    Liu, Chengfei
    Yu, Jeffrey Xu
    Zhou, Rui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 314 - +
  • [36] Matching heterogeneous textual data using spatial features
    Fize, Jacques
    Roche, Mathieu
    Teisseire, Maguelonne
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1389 - 1396
  • [37] Multi-Aspect Candidates for Repositioning: Data Fusion Methods Using Heterogeneous Information Sources
    Arany, A.
    Bolgar, B.
    Balogh, B.
    Antal, P.
    Matyus, P.
    CURRENT MEDICINAL CHEMISTRY, 2013, 20 (01) : 95 - 107
  • [38] Data Integration of Heterogeneous Data Sources Using QR Decomposition
    Sandhya, Harikumar
    Roy, Mekha Meriam
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 2, 2016, 385 : 333 - 344
  • [39] Propensity score stratified MAP prior and posterior inference for incorporating information across multiple potentially heterogeneous data sources
    Zhu, Angela Yaqian
    Roy, Dooti
    Zhu, Zheng
    Sailer, Martin Oliver
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024, 34 (02) : 190 - 204
  • [40] Analysis and Visualization of Seismic Data Using Mutual Information
    Tenreiro Machado, Jose A.
    Lopes, Antonio M.
    ENTROPY, 2013, 15 (09) : 3892 - 3909