Schema matching using interattribute dependencies

被引:14
|
作者
Kang, Jaewoo [1 ]
Naughton, Jeffrey F. [2 ]
机构
[1] Korea Univ, Coll Informat & Commun, Seoul 136705, South Korea
[2] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
schema matching; attribute dependency; graph matching;
D O I
10.1109/TKDE.2008.100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Schema matching is one of the key challenges in information integration. It is a labor-intensive and time-consuming process. To alleviate the problem, many automated solutions have been proposed. Most of the existing solutions mainly rely upon textual similarity of the data to be matched. However, there exist instances of the schema-matching problem for which they do not even apply. Such problem instances typically arise when the column names in the schemas and the data in the columns are opaque or very difficult to interpret. In our previous work, we proposed a two-step technique to address this problem. In the first step, we measure the dependencies between attributes within tables using an information-theoretic measure and construct a dependency graph for each table capturing the dependencies among attributes. In the second step, we find matching node pairs across the dependency graphs by running a graph-matching algorithm. In our previous work, we experimentally validated the accuracy of the approach. One remaining challenge is the computational complexity of the graph-matching problem in the second step. The problem instance we are facing is the weighted graph-matching problem to which no efficient solution has yet been found. In this paper, we extend the previous work by improving the second phase of the algorithm incorporating efficient approximation algorithms into the framework.
引用
收藏
页码:1393 / 1407
页数:15
相关论文
共 50 条
  • [1] A Schema Matching Method Based on Partial Functional Dependencies
    Li Guo-Hui
    Du Xiao-Kun
    Hu Fang-Xiao
    Du Jian-Qiang
    [J]. FCST: 2008 JAPAN-CHINA JOINT WORKSHOP ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2008, : 131 - +
  • [2] Schema matching using duplicates
    Bilke, A
    Naumann, F
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 69 - 80
  • [3] Schema matching using directed graph matching
    Amshakala, K.
    Nedunchezhian, R.
    [J]. WSEAS Transactions on Computers, 2013, 12 (09): : 341 - 354
  • [4] Using linguistic techniques for schema matching
    Unal, Ozgul
    Afsarmanesh, Hamideh
    [J]. ICSOFT 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2006, : 115 - +
  • [5] Schema matching using neural network
    Li, Y
    Liu, DB
    Zhang, WM
    [J]. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings, 2005, : 743 - 746
  • [6] Automated database schema design using mined data dependencies
    Wong, SKM
    Butz, CJ
    Xiang, Y
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1998, 49 (05): : 455 - 470
  • [7] A Method for Complex Schema Matching Using Corpus
    Qian, Ying
    Li, Yu-Xiang
    Zhang, Shuai
    Cui, Li
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 1, 2011, : 445 - 448
  • [8] Schema Normalization for Improving Schema Matching
    Sorrentino, Serena
    Bergamaschi, Sonia
    Gawinecki, Maciej
    Po, Laura
    [J]. CONCEPTUAL MODELING - ER 2009, PROCEEDINGS, 2009, 5829 : 280 - +
  • [9] On Defining Functional Dependencies in XML Schema
    Chen, Haitao
    Liao, Husheng
    Gao, Zengqi
    [J]. DATABASE THEORY AND APPLICATION, BIO-SCIENCE AND BIO-TECHNOLOGY, 2010, 118 : 120 - 131
  • [10] The Detection of Rectangular Shape Objects Using Matching Schema
    Ye, Soo-Young
    Choi, Joon-Young
    Nam, Ki-Gon
    [J]. TRANSACTIONS ON ELECTRICAL AND ELECTRONIC MATERIALS, 2016, 17 (06) : 363 - 368