Scalable Data Exchange with Functional Dependencies

被引:22
|
作者
Marnette, Bruno [1 ,2 ]
Mecca, Giansalvatore [3 ]
Papotti, Paolo [4 ]
机构
[1] Oxford Univ Comp Lab, UK & INRIA Saclay, Oxford, France
[2] INRIA Saclay, Palaiseau, France
[3] Univ Basilicata, Dipt Matemat & Informat, Potenza, Italy
[4] Univ Roma Tre, Dipt Informat & Automaz, Rome, Italy
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2010年 / 3卷 / 01期
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
D O I
10.14778/1920841.1920859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.
引用
收藏
页码:105 / 116
页数:12
相关论文
共 50 条
  • [21] RECOGNITION OF FUNCTIONAL DEPENDENCIES USING METEOROLOGICAL DATA
    VAPNIK, VN
    ROMANOV, LN
    [J]. IZVESTIYA AKADEMII NAUK SSSR FIZIKA ATMOSFERY I OKEANA, 1978, 14 (02): : 131 - 137
  • [23] Mining relaxed functional dependencies from data
    Caruccio, Loredana
    Deufemia, Vincenzo
    Polese, Giuseppe
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (02) : 443 - 477
  • [24] Mining relaxed functional dependencies from data
    Loredana Caruccio
    Vincenzo Deufemia
    Giuseppe Polese
    [J]. Data Mining and Knowledge Discovery, 2020, 34 : 443 - 477
  • [25] Threshold Functional Dependencies for Time Series Data
    Ji, Mingyue
    Wei, Xiukun
    Miao, Dongjing
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2020, 2020, 12115 : 164 - 174
  • [26] Elaboration on functional dependencies: Functional dependencies are dead, long live functional dependencies!
    KU Leuven, Belgium
    [J]. ACM SIGPLAN Not., 10 (133-147):
  • [27] Elaboration on Functional Dependencies: Functional Dependencies Are Dead, Long Live Functional Dependencies!
    Karachalias, Georgios
    Schrijvers, Tom
    [J]. ACM SIGPLAN NOTICES, 2017, 52 (10) : 133 - 147
  • [28] Scalable genomic data exchange and analytics with sBeacon
    Wickramarachchi, Anuradha
    Hosking, Brendan
    Jain, Yatish
    Grimes, John
    O'Brien, Mitchell J.
    Wright, Tracey
    Burgess, Mark A.
    Lin, Victor San Kho
    Reisinger, Florian
    Hofmann, Oliver
    Lawley, Michael
    Wilson, Laurence O. W.
    Twine, Natalie A.
    Bauer, Denis C.
    [J]. NATURE BIOTECHNOLOGY, 2023, 41 (11) : 1510 - 1512
  • [29] SEDEX: Scalable Entity Preserving Data Exchange
    Sekhavat, Yoones A.
    Parsons, Jeffrey
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (07) : 1878 - 1890
  • [30] Scalable genomic data exchange and analytics with sBeacon
    Anuradha Wickramarachchi
    Brendan Hosking
    Yatish Jain
    John Grimes
    Mitchell J. O’Brien
    Tracey Wright
    Mark A. Burgess
    Victor San Kho Lin
    Florian Reisinger
    Oliver Hofmann
    Michael Lawley
    Laurence O. W. Wilson
    Natalie A. Twine
    Denis C. Bauer
    [J]. Nature Biotechnology, 2023, 41 : 1510 - 1512