Efficient retrieval of replicated data

被引:3
|
作者
Tosun, Ali Saman [1 ]
机构
[1] Univ Texas, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
declustering; parallel I/O; design theory;
D O I
10.1007/s10619-006-8484-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Declustering is a common technique used to reduce query response times. Data is declustered over multiple disks and query retrieval can be parallelized. Most of the research on declustering is targeted at spatial range queries and investigates schemes with low additive error. Recently, declustering using replication has been proposed to reduce the additive overhead. Replication significantly reduces retrieval cost of arbitrary queries. In this paper, we propose a disk allocation and retrieval mechanism for arbitrary queries based on design theory. Using the proposed c-copy replicated declustering scheme, (c - 1)k(2) + ck buckets can be retrieved using at most k disk accesses. Retrieval algorithm is very efficient and is asymptotically optimal with Theta(\Q\) complexity for a query Q. In addition to the deterministic worst-case bound and efficient retrieval, proposed algorithm handles nonuniform data, high dimensions, supports incremental declustering and has good fault-tolerance property. Experimental results show the feasibility of the algorithm.
引用
收藏
页码:107 / 124
页数:18
相关论文
共 50 条
  • [1] Efficient retrieval of replicated data
    Ali Şaman Tosun
    [J]. Distributed and Parallel Databases, 2006, 19 : 107 - 124
  • [2] EFFICIENT MANAGEMENT OF REPLICATED DATA
    PARIS, JF
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1988, 326 : 396 - 409
  • [3] Efficient synchronization of replicated data in distributed systems
    Schütt, T
    Schintke, F
    Reinefeld, A
    [J]. COMPUTATIONAL SCIENCE - ICCS 2003, PT I, PROCEEDINGS, 2003, 2657 : 274 - 283
  • [4] Using reconfiguration for efficient management of replicated data
    Agrawal, D
    ElAbbadi, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (05) : 786 - 801
  • [5] Efficient, Chunk-Replicated Node Partitioned Data Warehouses
    Furtado, Pedro
    [J]. PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, 2008, : 578 - 583
  • [6] Binary vote assignment on a grid for efficient access of replicated data
    Deris, MM
    Evans, DJ
    Saman, MY
    Noraziah, A
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2003, 80 (12) : 1489 - 1498
  • [7] Efficient reuse of replicated parallel data segments in computational grids
    Tikar, Sandip
    Vadhiyar, Sathish
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (07): : 644 - 657
  • [8] AN EFFICIENT, FAULT-TOLERANT PROTOCOL FOR REPLICATED DATA MANAGEMENT
    SKEEN, D
    ELABBADI, A
    CRISTIAN, F
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1990, 448 : 171 - 191
  • [9] Generalized Optimal Response Time Retrieval of Replicated Data from Storage Arrays
    Altiparmak, Nihat
    Tosun, Ali Saman
    [J]. ACM TRANSACTIONS ON STORAGE, 2013, 9 (02)
  • [10] Efficient integrity verification of replicated data in cloud computing system
    Yi, Mingxu
    Wei, Jinxia
    Song, Lingwei
    [J]. COMPUTERS & SECURITY, 2017, 65 : 202 - 212