Job scheduling and dynamic data replication in data grid environment

被引:0
|
作者
Najme Mansouri
Gholam Hosein Dastghaibyfard
机构
[1] Shiraz University,Department of Computer Science & Engineering, College of Electrical & Computer Engineering
来源
关键词
Data grid; Data replication; Job scheduling; Simulation;
D O I
暂无
中图分类号
学科分类号
摘要
Data Grid is a geographically distributed environment that deals with large-scale data-intensive applications. Effective scheduling in Grid can reduce the amount of data transferred among nodes by submitting a job to a node, where most of the requested data files are available. Data replication is another key optimization technique for reducing access latency and managing large data by storing data in a wisely manner. In this paper, two algorithms are proposed: first, a novel job scheduling algorithm called Combined Scheduling Strategy (CSS) that considers the number of jobs waiting in queue, the location of required data for the job, and computational capability; second, a dynamic data replication strategy called Dynamic Hierarchical Replication Algorithm (DHRA) that improves file access time. DHRA stores each replica in an appropriate site, i.e., appropriate site in the requested region that has the highest number of access for that particular replica. Also, it can minimize access latency by selecting the best replica when various sites hold replicas of datasets. The simulation results demonstrate the proposed replication and scheduling strategies give better performance compared to the other algorithms.
引用
收藏
页码:204 / 225
页数:21
相关论文
共 50 条
  • [1] Job scheduling and dynamic data replication in data grid environment
    Mansouri, Najme
    Dastghaibyfard, Gholam Hosein
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 64 (01): : 204 - 225
  • [2] Job Scheduling for Dynamic Data Replication Strategy Based on Federation Data Grid Systems
    Zarina, M.
    Deris, M. Mat
    Rose, ANM. M.
    Isa, A. M.
    [J]. ADVANCES IN WIRELESS, MOBILE NETWORKS AND APPLICATIONS, 2011, 154 : 283 - +
  • [3] The impact of data replication on job scheduling performance in the Data Grid
    Tang, M
    Lee, BS
    Tang, XY
    Yeo, CK
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (03): : 254 - 268
  • [4] Efficient and dynamic Parallel Job Scheduling for bioinformatics Data Management in Data Grid Environment
    Kumar, K. Ashok
    Chandrasekar, C.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2015, 6 (03): : 1492 - 1501
  • [5] Combining data replication algorithms and job scheduling heuristics in the Data Grid
    Tang, M
    Lee, BS
    Tang, XY
    Yeo, CK
    [J]. EURO-PAR 2005 PARALLEL PROCESSING, PROCEEDINGS, 2005, 3648 : 381 - 390
  • [6] A Hierarchical Approach to Improve Job Scheduling and Data Replication in Data Grid
    Abdi, Somayeh
    Hashemi, Sayyed
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (03) : 278 - 285
  • [7] A Threshold-based Dynamic Data Replication and Parallel Job Scheduling strategy to enhance Data Grid
    Mansouri, N.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (03): : 957 - 977
  • [8] A Threshold-based Dynamic Data Replication and Parallel Job Scheduling strategy to enhance Data Grid
    N. Mansouri
    [J]. Cluster Computing, 2014, 17 : 957 - 977
  • [9] Improvement of Data Grid's performance by combining job scheduling with dynamic replication strategy
    Dang, Nhan Nguyen
    Hwang, Soonwook
    Lim, Sang Boem
    [J]. SIXTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2007, : 513 - +
  • [10] The Impact of the Implementation Cost of Replication in Data Grid Job Scheduling
    Nazir, Babar
    Ishaq, Faiza
    Shamshirband, Shahaboddin
    Chronopoulos, Anthony T.
    [J]. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2018, 23 (02)