Scalable Data Exchange with Functional Dependencies

被引:22
|
作者
Marnette, Bruno [1 ,2 ]
Mecca, Giansalvatore [3 ]
Papotti, Paolo [4 ]
机构
[1] Oxford Univ Comp Lab, UK & INRIA Saclay, Oxford, France
[2] INRIA Saclay, Palaiseau, France
[3] Univ Basilicata, Dipt Matemat & Informat, Potenza, Italy
[4] Univ Roma Tre, Dipt Informat & Automaz, Rome, Italy
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2010年 / 3卷 / 01期
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
D O I
10.14778/1920841.1920859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.
引用
收藏
页码:105 / 116
页数:12
相关论文
共 50 条
  • [1] Functional Dependencies Unleashed for Scalable Data Exchange
    Bonifati, Angela
    Ileana, Ioana
    Linardi, Michele
    [J]. 28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,
  • [2] Scalable Functional Dependencies Discovery from Big Data
    Tu Shouzhong
    Huang Minlie
    [J]. 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2016, : 426 - 431
  • [3] An Efficient and Scalable Algorithm to Mine Functional Dependencies from Distributed Big Data
    Wu, Wanqing
    Mao, Wenyu
    [J]. SENSORS, 2022, 22 (10)
  • [4] Scalable query reformulation using views in the presence of functional dependencies
    Bai, QY
    Hong, J
    McTear, MF
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 471 - 482
  • [5] Mining functional dependencies from data
    Hong Yao
    Howard J. Hamilton
    [J]. Data Mining and Knowledge Discovery, 2008, 16 : 197 - 219
  • [6] Pattern Functional Dependencies for Data Cleaning
    Qahtan, Abdulhakim
    Tang, Nan
    Ouzzani, Mourad
    Cao, Yang
    Stonebraker, Michael
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (05): : 684 - 697
  • [7] On discovery of functional dependencies from data
    Liu, Jixue
    Ye, Feiyue
    Li, Jiuyong
    Wang, Junhu
    [J]. DATA & KNOWLEDGE ENGINEERING, 2013, 86 : 146 - 159
  • [8] Mining functional dependencies from data
    Yao, Hong
    Hamilton, Howard J.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (02) : 197 - 219
  • [9] Conditional functional dependencies for data cleaning
    Bohannon, Philip
    Fan, Wenfei
    Geerts, Floris
    Jia, Xibei
    Kementsietsidis, Anastasios
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 721 - 730
  • [10] Improving XML Data Quality with Functional Dependencies
    Tan, Zijing
    Zhang, Liyong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 450 - 465