Data exchange: semantics and query answering

被引:524
|
作者
Fagin, R
Kolaitis, PG
Miller, RJ
Popa, L
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] Univ Toronto, Toronto, ON, Canada
关键词
data exchange; data integration; dependencies; universal solution; chase; query answering; certain answers; computational complexity; first-order inexpressibility;
D O I
10.1016/j.tcs.2004.10.033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data exchange is the problem of taking data structured under a source schema and creating an instance of a target schema that reflects the source data as accurately as possible. In this paper, we address foundational and algorithmic issues related to the semantics of data exchange and to the query answering problem in the context of data exchange. These issues arise because, given a source instance, there may be many target instances that satisfy the constraints of the data exchange problem. We give an algebraic specification that selects, among all solutions to the data exchange problem, a special class of solutions that we call universal. We show that a universal solution has no more and no less data than required for data exchange and that it represents the entire space of possible solutions. We then identify fairly general, yet practical, conditions that guarantee the existence of a universal solution and yield algorithms to compute a canonical universal solution efficiently. We adopt the notion of the "certain answers" in indefinite databases for the semantics for query answering in data exchange. We investigate the computational complexity of computing the certain answers in this context and also address other algorithmic issues that arise in data exchange. In particular, we study the problem of computing the certain answers of target queries by simply evaluating them on a canonical universal solution, and we explore the boundary of what queries can and cannot be answered this way, in a data exchange setting. 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 124
页数:36
相关论文
共 50 条
  • [31] Data Mining for XML Query-Answering Support
    Mazuran, Mirjana
    Quintarelli, Elisa
    Tanca, Letizia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1393 - 1407
  • [32] Linguistic Query Answering on Data Cubes with Time Dimension
    Castillo-Ortega, Rita
    Marin, Nicolas
    Sanchez, Daniel
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (10) : 1002 - 1021
  • [33] Approximate query answering using data warehouse striping
    Bernardino, JR
    Furtado, PS
    Madeira, HC
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 19 (02) : 145 - 167
  • [34] A Semi-automatic Data Generator for Query Answering
    Angiulli, Fabrizio
    Del Prete, Alessandra
    Fassetti, Fabio
    Nistico, Simona
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 106 - 114
  • [35] An efficient consistent query answering method for data integration
    Zhang, Xiao-Gang
    Yang, Lu-Ming
    Pan, Jiu-Hui
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (08): : 1474 - 1479
  • [36] Approximate Query Answering Using Data Warehouse Striping
    Jorge R. Bernardino
    Pedro S. Furtado
    Henrique C. Madeira
    Journal of Intelligent Information Systems, 2002, 19 : 145 - 167
  • [37] Query answering with transitive and linear-ordered data
    1600, AI Access Foundation (63):
  • [38] A graph-based approach for query answering under inconsistency-tolerant semantics
    Fu X.
    Qi G.
    Zhang Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2016, 53 (02): : 303 - 315
  • [39] Query Similarity for Approximate Query Answering
    Kantere, Verena
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT II, 2016, 9828 : 355 - 367
  • [40] Query answering in circumscription
    Helft, Nicolas
    Inoue, Katsumi
    Poole, David
    1600, Morgan Kaufmann Publ Inc, San Mateo, CA, USA (01):