A dynamic data grid replication strategy to minimize the data missed

被引:0
|
作者
Lei, Ming [1 ]
Vrbsky, Susan V. [1 ]
Hong, Xiaoyan [1 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
关键词
data availability; Data Grid; data missing rate; limited storage; replica strategy;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The data availability in a Data Grid system is complicated by node failure, data catalog error and an unreliable network. To improve the job response time and data availability, data is typically replicated in large scale data-massive applications. However, the dynamic behavior of a Grid user makes it difficult to determine where and how to make data replications to meet the system availability goal. Some strategies for data replication have previously been proposed, but they assumed unlimited storage for replicas. In this paper, we present two new metrics to measure the system data availability. We then model the system availability problem assuming limited replica storage and transfer this to, a classic optimal problem. We present four strategies for limited replica storage that maximize the data availability by minimizing the data missed rate (MinDmr), based on a file weight and prediction function. Our simulation on the OptorSim shows our MinDmr algorithm achieves better performance overall than others in term of data availability. Results indicate the performance of MinDmr is always better than others with varying prediction functions, job schedulers and file access patterns, as far as the data missing rate is concerned.
引用
收藏
页码:721 / +
页数:2
相关论文
共 50 条
  • [21] Replication decision making for dynamic data replica management in data grid
    Kim, MO
    Her, JH
    Ramakrishna, RS
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 794 - 800
  • [22] A Replication Strategy for Fault Tolerance in Data Grid Environment
    Li, Jing
    ACC 2009: ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2009, : 363 - 366
  • [23] Enhanced Fast Spread Replication strategy for Data Grid
    Bsoul, Mohammad
    Al-Khasawneh, Ahmad
    Abdallah, Emad Eddien
    Kilani, Yousef
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (02) : 575 - 580
  • [24] Dynamic Data Replication Strategy in Cloud Environments
    Jayalakshmi, D. S.
    Ranjana, Rashmi T. P.
    Srinivasan, R.
    2015 FIFTH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC), 2015, : 102 - 105
  • [25] A dynamic file replication strategy in data grids
    Yang, Chao-Tung
    Fu, Chun-Pin
    Huang, Chien-Jung
    TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 538 - 542
  • [26] A dynamic data replication technology in educational resource grid
    Gao, Tian
    Liu, Fang'ai
    PROCEEDINGS OF THE 2007 1ST INTERNATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGIES AND APPLICATIONS IN EDUCATION (ISITAE 2007), 2007, : 287 - 291
  • [27] Dynamic replication strategies in data grid systems: a survey
    Tos, Uras
    Mokadem, Riad
    Hameurlain, Abdelkader
    Ayav, Tolga
    Bora, Sebnem
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4116 - 4140
  • [28] Dynamic replication strategies in data grid systems: a survey
    Uras Tos
    Riad Mokadem
    Abdelkader Hameurlain
    Tolga Ayav
    Sebnem Bora
    The Journal of Supercomputing, 2015, 71 : 4116 - 4140
  • [29] A dynamic replica management strategy in data grid
    Mansouri, Najme
    Dastghaibyfard, Gholam Hosein
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2012, 35 (04) : 1297 - 1303
  • [30] Dynamic strategy of placement of the replicas in data grid
    Belalem, Ghalem
    Bouhraoua, Farouk
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 496 - +