Scalable Data Exchange with Functional Dependencies

被引:22
|
作者
Marnette, Bruno [1 ,2 ]
Mecca, Giansalvatore [3 ]
Papotti, Paolo [4 ]
机构
[1] Oxford Univ Comp Lab, UK & INRIA Saclay, Oxford, France
[2] INRIA Saclay, Palaiseau, France
[3] Univ Basilicata, Dipt Matemat & Informat, Potenza, Italy
[4] Univ Roma Tre, Dipt Informat & Automaz, Rome, Italy
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2010年 / 3卷 / 01期
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
D O I
10.14778/1920841.1920859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.
引用
收藏
页码:105 / 116
页数:12
相关论文
共 50 条
  • [41] Scalable Communication Protocols for Dynamic Sparse Data Exchange
    Hoefler, Torsten
    Siebert, Christian
    Lumsdaine, Andrew
    [J]. PPOPP 2010: PROCEEDINGS OF THE 2010 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2010, : 159 - 168
  • [42] Semandaq: A Data Quality System Based on Conditional Functional Dependencies
    Fan, Wenfei
    Geerts, Floris
    Jia, Xibei
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1460 - 1463
  • [43] Spatio-Temporal Functional Dependencies for Sensor Data Streams
    Charfi, Manel
    Gripay, Yann
    Petit, Jean-Marc
    [J]. ADVANCES IN SPATIAL AND TEMPORAL DATABASES, SSTD 2017, 2017, 10411 : 182 - 199
  • [44] Automated Inference with Fuzzy Functional Dependencies over Graded Data
    Manuel Rodriguez-Jimenez, Jose
    Cordero, Pablo
    Enciso, Manuel
    Mora, Angel
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2013, 7903 : 254 - 265
  • [45] Discovering Functional Dependencies from Mixed-Type Data
    Mandros, Panagiotis
    Kaltenpoth, David
    Boley, Mario
    Vreeken, Jilles
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1404 - 1414
  • [46] Bayes Performance of Batch Data Mining Based on Functional Dependencies
    Xi, Haixu
    Ye, Feiyue
    He, Sheng
    Liu, Yijun
    Jiang, Hongfen
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (03)
  • [47] FOX: Inference of approximate functional dependencies from XML data
    Fassetti, Fabio
    Fazzinga, Bettina
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 10 - +
  • [48] Privacy-Preserving Publishing Data with Full Functional Dependencies
    Wang, Hui
    Liu, Ruilin
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 176 - 183
  • [49] Functional dependencies and the design of relational databases involving imprecise data
    Bosc, P
    Liétard, L
    [J]. INFORMATION, UNCERTAINTY AND FUSION, 2000, 516 : 45 - 56
  • [50] Fuzzy data dependencies and implication of fuzzy data dependencies
    Liu, WY
    [J]. FUZZY SETS AND SYSTEMS, 1997, 92 (03) : 341 - 348