Efficient reuse of replicated parallel data segments in computational grids

被引:10
|
作者
Tikar, Sandip [2 ]
Vadhiyar, Sathish [1 ]
机构
[1] Indian Inst Sci, Supercomp Educ & Res Ctr, Bangalore 560012, Karnataka, India
[2] EverGrid, Pune 411045, Maharashtra, India
关键词
replica selection; scheduling; data movement; grids; parallel applications; parallel algorithms;
D O I
10.1016/j.future.2008.01.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Grids are being used for executing parallel applications over remote resources. For executing a parallel application on a set of grid resources chosen by a user or a grid scheduler, the input data needed by the application is segmented according to the data distribution followed in the application and the data segments are distributed to the grid resources. The same input data may be used subsequently by different applications leading to multiple copies (replicas) of parallel data segments in various grid resources. The data needed for a parallel application can be gathered from the existing replicas onto the computational resources chosen by the grid scheduler for application execution. In this work, we have devised novel algorithms for determining "nearest" replica sites containing data segments needed by a parallel application executing on a set of resources with the objective of minimizing the time needed for transferring the data segments from the replica sites to the resources. We have tested our algorithms on different kinds of experimental setups. We find that the best algorithm varies according to the configuration of data servers and clients. In all cases, our algorithms performed better than the existing algorithms by at least 15%. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:644 / 657
页数:14
相关论文
共 50 条
  • [1] Efficient Memory Partitioning for Parallel Data Access via Data Reuse
    Su, Jincheng
    Yang, Fan
    Zeng, Xuan
    Zhou, Dian
    [J]. PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 138 - 147
  • [2] Towards efficient data distribution on computational desktop grids with BitTorrent
    Wei, Baohua
    Fedak, Gilles
    Cappello, Franck
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2007, 23 (08): : 983 - 989
  • [3] An efficient and bandwidth sensitive parallel download scheme in data grids
    Chang, Ruay-Shiung
    Lin, Chun-Fu
    Ruey, Jiing-Hsing
    Hsi, Shih-Chun
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEM SOFTWARE AND MIDDLEWARE AND WORKSHOPS, VOLS 1 AND 2, 2008, : 296 - 301
  • [4] Efficient Memory Partitioning for Parallel Data Access in FPGA via Data Reuse
    Su, Jincheng
    Yang, Fan
    Zeng, Xuan
    Zhou, Dian
    Chen, Jie
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (10) : 1674 - 1687
  • [5] Performance of parallel applications on computational grids
    Le, TT
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, : 433 - 438
  • [6] Efficient broadcast on computational grids
    Mateescu, G
    Taylor, R
    [J]. HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2003, : 189 - 192
  • [7] Efficient retrieval of replicated data
    Tosun, Ali Saman
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2006, 19 (2-3) : 107 - 124
  • [8] EFFICIENT MANAGEMENT OF REPLICATED DATA
    PARIS, JF
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1988, 326 : 396 - 409
  • [9] Efficient retrieval of replicated data
    Ali Şaman Tosun
    [J]. Distributed and Parallel Databases, 2006, 19 : 107 - 124
  • [10] Load Balanced, Efficient Scheduling With Parallel Job Submission in Computational Grids Using Parallel Particle Swarm Optimization
    Sadhasivam, D. Sudha
    Meenakshi, D. Komagal
    [J]. 2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 175 - 180