Data exchange: semantics and query answering

被引:524
|
作者
Fagin, R
Kolaitis, PG
Miller, RJ
Popa, L
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] Univ Toronto, Toronto, ON, Canada
关键词
data exchange; data integration; dependencies; universal solution; chase; query answering; certain answers; computational complexity; first-order inexpressibility;
D O I
10.1016/j.tcs.2004.10.033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data exchange is the problem of taking data structured under a source schema and creating an instance of a target schema that reflects the source data as accurately as possible. In this paper, we address foundational and algorithmic issues related to the semantics of data exchange and to the query answering problem in the context of data exchange. These issues arise because, given a source instance, there may be many target instances that satisfy the constraints of the data exchange problem. We give an algebraic specification that selects, among all solutions to the data exchange problem, a special class of solutions that we call universal. We show that a universal solution has no more and no less data than required for data exchange and that it represents the entire space of possible solutions. We then identify fairly general, yet practical, conditions that guarantee the existence of a universal solution and yield algorithms to compute a canonical universal solution efficiently. We adopt the notion of the "certain answers" in indefinite databases for the semantics for query answering in data exchange. We investigate the computational complexity of computing the certain answers in this context and also address other algorithmic issues that arise in data exchange. In particular, we study the problem of computing the certain answers of target queries by simply evaluating them on a canonical universal solution, and we explore the boundary of what queries can and cannot be answered this way, in a data exchange setting. 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 124
页数:36
相关论文
共 50 条
  • [41] Using semantics for query derivability in data warehouse applications
    Albrecht, J
    Hümmer, W
    Lehner, W
    Schlesinger, L
    FLEXIBLE QUERY ANSWERING SYSTEMS: RECENT ADVANCES, 2001, : 3 - 14
  • [42] Data Cleaning and Query Answering with Matching Dependencies and Matching Functions
    Leopoldo Bertossi
    Solmaz Kolahi
    Laks V. S. Lakshmanan
    Theory of Computing Systems, 2013, 52 : 441 - 482
  • [43] A study on answering a data mining query using a materialized view
    Zakrzewicz, M
    Morzy, M
    Wojciechowski, M
    COMPUTER AND INFORMATION SCIENCES - ISCIS 2004, PROCEEDINGS, 2004, 3280 : 493 - 502
  • [44] Distributed SPARQL query answering over RDF data streams
    Leida, Marcello
    Chu, Andrej
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 369 - 378
  • [45] Ontology-Based Query Answering for Probabilistic Temporal Data
    Koopmann, Patrick
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2903 - 2910
  • [46] SPARQL Query Answering with RDFS Reasoning on Correlated Probabilistic Data
    Szeto, Chi-Cheong
    Hung, Edward
    Deng, Yu
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 56 - +
  • [47] On the data complexity of consistent query answering over graph databases
    Barcelo, Pablo
    Fontaine, Gaelle
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2017, 88 : 164 - 194
  • [48] Privacy Preserving Query Answering in Peer Data Management Systems
    Nazi, Azade
    Liu, Donggang
    Das, Sajal K.
    2013 33RD IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW 2013), 2013, : 64 - 69
  • [49] An Effective Framework for Enhancing Query Answering in a Heterogeneous Data Lake
    Yuan, Qin
    Yuan, Ye
    Wen, Zhenyu
    Wang, He
    Tang, Shiyuan
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 770 - 780
  • [50] Data Cleaning and Query Answering with Matching Dependencies and Matching Functions
    Bertossi, Leopoldo
    Kolahi, Solmaz
    Lakshmanan, Laks V. S.
    THEORY OF COMPUTING SYSTEMS, 2013, 52 (03) : 441 - 482