Normalization and optimization of schema mappings

被引:12
|
作者
Gottlob, Georg [2 ]
Pichler, Reinhard [1 ]
Savenkov, Vadim [1 ]
机构
[1] Vienna Univ Technol, Database & Artificial Intelligence Grp, Inst Informat Syst, A-1040 Vienna, Austria
[2] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
来源
VLDB JOURNAL | 2011年 / 20卷 / 02期
关键词
Data integration; Data exchange; Schema mappings optimization; DATA EXCHANGE;
D O I
10.1007/s00778-011-0226-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Schema mappings are high-level specifications that describe the relationship between database schemas. They are an important tool in several areas of database research, notably in data integration and data exchange. However, a concrete theory of schema mapping optimization including the formulation of optimality criteria and the construction of algorithms for computing optimal schema mappings is completely lacking to date. The goal of this work is to fill this gap. We start by presenting a system of rewrite rules to minimize sets of source-to-target tuple-generating dependencies. Moreover, we show that the result of this minimization is unique up to variable renaming. Hence, our optimization also yields a schema mapping normalization. By appropriately extending our rewrite rule system, we also provide a normalization of schema mappings containing equality-generating target dependencies. An important application of such a normalization is in the area of defining the semantics of query answering in data exchange, since several definitions in this area depend on the concrete syntactic representation of the mappings. This is, in particular, the case for queries with negated atoms and for aggregate queries. The normalization of schema mappings allows us to eliminate the effect of the concrete syntactic representation of the mapping from the semantics of query answering. We discuss in detail how our results can be fruitfully applied to aggregate queries.
引用
下载
收藏
页码:277 / 302
页数:26
相关论文
共 50 条
  • [31] MapMerge: correlating independent schema mappings
    Alexe, Bogdan
    Hernandez, Mauricio
    Popa, Lucian
    Tan, Wang-Chiew
    VLDB JOURNAL, 2012, 21 (02): : 191 - 211
  • [32] Schema Merging Based on Semantic Mappings
    Rizopoulos, Nikos
    McBrien, Peter
    DATASPACE: THE FINAL FRONTIER, PROCEEDINGS, 2009, 5588 : 193 - 198
  • [33] A UML profile for modeling schema mappings
    Kurz, Stefan
    Guppenberger, Michael
    Freitag, Burkhard
    ADVANCES IN CONCEPTUAL MODELING - THEORY AND PRACTICE, PROCEEDINGS, 2006, 4231 : 53 - +
  • [34] Managing uncertainty in schema matching with top-K schema mappings
    Gal, Avigdor
    JOURNAL ON DATA SEMANTICS VI, 2006, 4090 : 90 - 114
  • [35] Schema integration based on uncertain semantic mappings
    Magnani, M
    Rizopoulos, N
    McBrien, P
    Montesi, D
    CONCEPTUAL MODELING - ER 2005, 2005, 3716 : 31 - 46
  • [36] Executable schema mappings for statistical data processing
    Atzeni, Paolo
    Bellomarini, Luigi
    Bugiotti, Francesca
    De Leonardis, Marco
    DISTRIBUTED AND PARALLEL DATABASES, 2018, 36 (02) : 265 - 300
  • [37] Characterizing Schema Mappings via Data Examples
    Alexe, Bogdan
    Kolaitis, Phokion G.
    Tan, Wang-Chiew
    PODS 2010: PROCEEDINGS OF THE TWENTY-NINTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2010, : 261 - 271
  • [38] XML schema mappings for heterogeneous database access
    Collins, SR
    Navathe, S
    Mark, L
    INFORMATION AND SOFTWARE TECHNOLOGY, 2002, 44 (04) : 251 - 257
  • [39] Generic schema mappings for composition and query answering
    Kensche, David
    Quix, Christoph
    Xiang Li
    Yong Li
    Jarke, Matthias
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (07) : 599 - 621
  • [40] Executable schema mappings for statistical data processing
    Paolo Atzeni
    Luigi Bellomarini
    Francesca Bugiotti
    Marco De Leonardis
    Distributed and Parallel Databases, 2018, 36 : 265 - 300