PFRF: An adaptive data replication algorithm based on star-topology data grids

被引:29
|
作者
Lee, Ming-Chang [1 ]
Leu, Fang-Yie [2 ]
Chen, Ying-ping [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Taipei, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung, Taiwan
关键词
Data grid; Data replication; Data access patterns; File popularity; PFRF; STRATEGY; DISTRIBUTIONS; SERVICE;
D O I
10.1016/j.future.2011.08.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, data replication algorithms have been widely employed in data grids to replicate frequently accessed data to appropriate sites. The purposes are shortening file transmission distance and delivering files from nearby sites to local sites so as to improve data access performance and reduce bandwidth consumption. Some of the algorithms were designed based on unlimited storage. However, they might not be practical in real-world data grids since currently no system has infinite storage. Others were implemented on limited storage environments, but none of them considers data access patterns which reflect the changes of users' interests, and these are important parameters affecting file retrieval efficiency and bandwidth consumption. In this paper, we propose an adaptive data replication algorithm, called the Popular File Replicate First algorithm (PFRF for short), which is developed on a star-topology data grid with limited storage space based on aggregated information on previous file accesses. The PFRF periodically calculates file access popularity to track the variation of users' access behaviors, and then replicates popular files to appropriate sites to adapt to the variation. We employ several types of file access behaviors, including Zipf-like, geometric, and uniform distributions, to evaluate PFRF. The simulation results show that PFRF can effectively improve average job turnaround time, bandwidth consumption for data delivery, and data availability as compared with those of the tested algorithms. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1045 / 1057
页数:13
相关论文
共 50 条
  • [1] A New Fuzzy Based Dynamic Data Replication Algorithm in Data Grids
    Beigrezaei, Mahsa
    Kanan, Hamidreza Rashidy
    Haghighat, Abolfazl Toroghi
    [J]. 2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [2] A File Group Data Replication Algorithm for Data Grids
    Amir Masoud Rahmani
    Leila Azari
    Helder A. Daniel
    [J]. Journal of Grid Computing, 2017, 15 : 379 - 393
  • [3] A File Group Data Replication Algorithm for Data Grids
    Rahmani, Amir Masoud
    Azari, Leila
    Daniel, Helder A.
    [J]. JOURNAL OF GRID COMPUTING, 2017, 15 (03) : 379 - 393
  • [4] A data replication algorithm for groups of files in data grids
    Azari, Leila
    Rahmani, Amir Masoud
    Daniel, Helder A.
    Qader, Nooruldeen Nasih
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 113 : 115 - 126
  • [5] An adaptive data replication algorithm
    Wolfson, O
    Jajodia, S
    Huang, YX
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1997, 22 (02): : 255 - 314
  • [6] Combination of data replication and scheduling algorithm for improving data availability in Data Grids
    Mansouri, Najme
    Dastghaibyfard, Gholam Hosein
    Mansouri, Ehsan
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2013, 36 (02) : 711 - 722
  • [7] TSR: Topology Reduction from Tree to Star Data Grids
    Lee, Ming-Chang
    Leu, Fang-Yie
    Chen, Ying-ping
    [J]. 2013 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING (IMIS 2013), 2013, : 678 - 683
  • [8] PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids
    Saadat, Nazanin
    Rahmani, Amir Masoud
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (04): : 666 - 681
  • [9] A strategy for data replication in data grids
    Xu, LT
    Wang, B
    Ai, B
    [J]. CURRENT TRENDS IN HIGH PERFORMANCE COMPUTING AND ITS APPLICATIONS, PROCEEDINGS, 2005, : 557 - 562
  • [10] Data Replication and the Storage Capacity of Data Grids
    Figueira, Silvia
    Trieu, Tan
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2008, 2008, 5336 : 567 - +