Efficient reuse of replicated parallel data segments in computational grids

被引:10
|
作者
Tikar, Sandip [2 ]
Vadhiyar, Sathish [1 ]
机构
[1] Indian Inst Sci, Supercomp Educ & Res Ctr, Bangalore 560012, Karnataka, India
[2] EverGrid, Pune 411045, Maharashtra, India
关键词
replica selection; scheduling; data movement; grids; parallel applications; parallel algorithms;
D O I
10.1016/j.future.2008.01.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Grids are being used for executing parallel applications over remote resources. For executing a parallel application on a set of grid resources chosen by a user or a grid scheduler, the input data needed by the application is segmented according to the data distribution followed in the application and the data segments are distributed to the grid resources. The same input data may be used subsequently by different applications leading to multiple copies (replicas) of parallel data segments in various grid resources. The data needed for a parallel application can be gathered from the existing replicas onto the computational resources chosen by the grid scheduler for application execution. In this work, we have devised novel algorithms for determining "nearest" replica sites containing data segments needed by a parallel application executing on a set of resources with the objective of minimizing the time needed for transferring the data segments from the replica sites to the resources. We have tested our algorithms on different kinds of experimental setups. We find that the best algorithm varies according to the configuration of data servers and clients. In all cases, our algorithms performed better than the existing algorithms by at least 15%. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:644 / 657
页数:14
相关论文
共 50 条
  • [31] Resource co-allocation for parallel tasks in computational grids
    Wang, LZ
    Cai, WT
    Lee, BS
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON CHALLENGES OF LARGE APPLICATIONS IN DISTRIBUTED ENVIRONMENTS, 2003, : 88 - 95
  • [32] Parallel Memetic Algorithms for Independent Job Scheduling in Computational Grids
    Xhafa, Fatos
    Duran, Bernat
    [J]. RECENT ADVANCES IN EVOLUTIONARY COMPUTATION FOR COMBINATORIAL OPTIMIZATION, 2008, 153 : 219 - 239
  • [33] Mining internet data sets for computational grids
    Borzemski, L
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2005, 3683 : 268 - 274
  • [34] Overview of security considerations for computational and data Grids
    Johnston, WE
    Jackson, KR
    Talwar, S
    [J]. 10TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2001, : 439 - 440
  • [35] Interoperable PKI Data Distribution in Computational Grids
    Pala, Massimiliano
    Cholia, Shreyas
    Rea, Scott A.
    Smith, Sean W.
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2009, 1 (02) : 56 - 73
  • [36] Computational grids in the clinic: Search for an IMRT efficient frontier
    Dimitroyannis, D. A.
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2007, 69 (03): : S630 - S631
  • [37] Efficient algorithms for the reliabilities of replicated n-parallel-task graphs
    Hwang, FK
    Wright, PE
    [J]. OPERATIONS RESEARCH LETTERS, 1996, 19 (04) : 171 - 174
  • [38] An efficient replication scheme for data grids
    Nam, DS
    Youn, CH
    Jeong, SJ
    Shim, EB
    Lee, EY
    Park, EK
    [J]. 2004 12TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, VOLS 1 AND 2 , PROCEEDINGS: UNITY IN DIVERSITY, 2004, : 392 - 396
  • [39] Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids
    Santos-Neto, E
    Cirne, W
    Brasileiro, F
    Lima, A
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2005, 3277 : 210 - 232
  • [40] Scalability analysis of parallel Particle-In-Cell codes on computational grids
    Tao, WeiFeng
    Cai, DongSheng
    Yan, XiaoYang
    Ken-ichi, Nishikawa
    Lembege, Bertrand
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2008, 179 (12) : 855 - 864