A research agenda for query processing in large-scale Peer Data Management Systems

被引:8
|
作者
Hose, Katja [1 ]
Roth, Armin [2 ]
Zeitz, Andre [3 ]
Sattler, Kai-Uwe [1 ]
Naumann, Felix [2 ]
机构
[1] Tech Univ Ilmenau, FG Datenbanken & Informat Syst, D-98684 Ilmenau, Germany
[2] HPI Softwaresyst Tech, D-14482 Potsdam, Germany
[3] Univ Rostock, Univ Rechenzentrum Lehrstuhl Datenbank & Informat, D-18051 Rostock, Germany
关键词
query processing; large-scale data management; Peer Data Management Systems; PDMS;
D O I
10.1016/j.is.2008.01.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Peer Data Management Systems (PDMS) are a novel, useful, but challenging paradigm for distributed data management and query processing. Conventional integrated information systems have a hierarchical structure with an integration component that manages a global schema and distributes queries against this schema to the underlying data sources. PDMS are a natural extension to this architecture by allowing each participating system (peer) to act both as a data source and as an integrator. Peers are interconnected by schema mappings, which guide the rewriting of queries between the heterogeneous schemas, and thus form a P2P (peer-to-peer)-like network. Despite several years of research, the development of efficient PDMS Still holds many challenges. In this article we first survey the state of the art on peer data management: We classify PDms by characteristics concerning their system model, their semantics, their query planning schemes, and their maintenance. Then we systematically examine open research directions in each of those areas. In particular, we observe that research results from both the domain of P2P systems and of conventional distributed data management can have an impact on the development of PDms. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:597 / 610
页数:14
相关论文
共 50 条