Matching Attributes Across Overlapping Heterogeneous Data Sources Using Mutual Information

被引:3
|
作者
Zhao, Huimin [1 ]
机构
[1] Univ Wisconsin Milwaukee, Sheldon B Lubar Sch Business, Milwaukee, WI 53201 USA
关键词
Attribute Correspondence; Attribute Matching; Heterogeneous Databases; Information Theory; Mutual Information; SEMANTIC-INTEGRATION; SCHEMA; CORRESPONDENCES; RETRIEVAL; DATABASES;
D O I
10.4018/jdm.2010100105
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.
引用
收藏
页码:91 / 110
页数:20
相关论文
共 50 条
  • [1] Semantic matching across heterogeneous data sources
    Zhao, Huimin
    COMMUNICATIONS OF THE ACM, 2007, 50 (01) : 45 - 50
  • [2] Entity Matching across Heterogeneous Sources
    Yang, Yang
    Sun, Yizhou
    Tang, Jie
    Ma, Bo
    Li, Juanzi
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1395 - 1404
  • [3] High-performance spatiotemporal trajectory matching across heterogeneous data sources
    Gong, Xuri
    Huang, Zhou
    Wang, Yaoli
    Wu, Lun
    Liu, Yu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 : 148 - 161
  • [4] Entity matching across heterogeneous data sources: An approach based on constrained cascade generalization
    Zhao, Huimin
    Ram, Sudha
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (03) : 368 - 381
  • [5] Information sharing among multiple heterogeneous data sources distributed across the Internet
    Ram, S
    Ramesh, V
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL IV: INTERNET AND THE DIGITAL ECONOMY TRACT, 1998, : 504 - 504
  • [6] Exploring attribute correspondences across heterogeneous databases by mutual information
    Zhao, HM
    Soofi, ES
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2006, 22 (04) : 305 - 336
  • [7] Matching point features using mutual information
    Rangarajan, A
    Duncan, JS
    WORKSHOP ON BIOMEDICAL IMAGE ANALYSIS, PROCEEDINGS, 1998, : 172 - 181
  • [8] Boolean query mapping across heterogeneous information sources
    Stanford Univ, Stanford, United States
    IEEE Trans Knowl Data Eng, 4 (515-521):
  • [9] Boolean query mapping across heterogeneous information sources
    Chang, KCC
    GarciaMolina, H
    Paepcke, A
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (04) : 515 - 521
  • [10] DEM: Deep Entity Matching Across Heterogeneous Information Networks
    Kong, Chao
    Chen, Bao-Xiang
    Zhang, Li-Ping
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (04) : 739 - 750