Generalized Optimal Response Time Retrieval of Replicated Data from Storage Arrays

被引:4
|
作者
Altiparmak, Nihat [1 ]
Tosun, Ali Saman [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
基金
美国国家科学基金会;
关键词
Design; Algorithms; Performance; Declustering; replication; storage arrays; generalized retrieval; maximum flow; linear programming; ALLOCATION;
D O I
10.1145/2491472.2491474
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Declustering techniques reduce query response times through parallel I/O by distributing data among parallel disks. Recently, replication-based approaches were proposed to further reduce the response time. Efficient retrieval of replicated data from multiple disks is a challenging problem. Existing retrieval techniques are designed for storage arrays with identical disks, having no initial load or network delay. In this article, we consider the generalized retrieval problem of replicated data where the disks in the system might be heterogeneous, the disks may have initial load, and the storage arrays might be located on different sites. We first formulate the generalized retrieval problem using a Linear Programming (LP) model and solve it with mixed integer programming techniques. Next, the generalized retrieval problem is formulated as a more efficient maximum flow problem. We prove that the retrieval schedule returned by the maximum flow technique yields the optimal response time and this result matches the LP solution. We also propose a low-complexity online algorithm for the generalized retrieval problem by not guaranteeing the optimality of the result. Performance of proposed and state of the art retrieval strategies are investigated using various replication schemes, query types, query loads, disk specifications, network delays, and initial loads.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] OPTIMAL DATA-RETRIEVAL FOR HIGH-DENSITY STORAGE
    BURKHARDT, H
    [J]. VLSI AND COMPUTER PERIPHERALS: VLSI AND MICROELECTRONIC APPLICATIONS IN INTELLIGENT PERIPHERALS AND THEIR INTERCONNECTION NETWORKS, 1989, : A43 - A48
  • [2] Private Information Retrieval from Non-Replicated Databases with Optimal Message Size
    Keramaati, S. Niloofar
    Salehkalaibar, Sadaf
    [J]. 2020 IRAN WORKSHOP ON COMMUNICATION AND INFORMATION THEORY (IWCIT 2020), 2020,
  • [3] Response time behavior of distributed voting algorithms for managing replicated data
    Chen, IR
    Wang, DC
    Chu, CP
    [J]. INFORMATION PROCESSING LETTERS, 2000, 75 (06) : 247 - 253
  • [4] A SYSTEM FOR STORAGE AND RETRIEVAL OF DATA FROM AUTOPSIES
    CARPENTER, HM
    [J]. AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 1962, 38 (05) : 449 - &
  • [5] A scalable storage supporting multistream real-time data retrieval
    Chiung-Shien Wu
    Gin-Kou Ma
    Mei-Chian Liu
    [J]. Multimedia Systems, 1999, 7 : 458 - 466
  • [6] A scalable storage supporting multistream real-time data retrieval
    Wu, CS
    Ma, GK
    Liu, MC
    [J]. MULTIMEDIA SYSTEMS, 1999, 7 (06) : 458 - 466
  • [7] Relative pre-positioning of storage/retrieval machines in automated storage/retrieval systems to minimize maximum system response time
    Chang, SH
    Egbelu, PJ
    [J]. IIE TRANSACTIONS, 1997, 29 (04) : 303 - 312
  • [8] Relative pre-positioning of storage/retrieval machines in automated storage/retrieval system to minimize expected system response time
    Chang, SH
    Egbelu, PJ
    [J]. IIE TRANSACTIONS, 1997, 29 (04) : 313 - 322
  • [9] Optimal Network Coding-based In-Network Data Storage and Data Retrieval for IoT/WSNs
    Oliveira, Camila H. S.
    Ghamri-Doudane, Yacine
    Brito, Carlos E. F.
    Lohier, Stephane
    [J]. 2015 IEEE 14TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2015, : 208 - 215
  • [10] Deduplication Based Storage and Retrieval of Data from Cloud Environment
    Pritha, N. Lakshmi
    Velmurugan, N.
    Winster, S. Godfrey
    Vijayaraj, A.
    [J]. INTERNATIONAL CONFERENCE ON INNOVATION INFORMATION IN COMPUTING TECHNOLOGIES, 2015, 2015,