Data intensive and network aware (DIANA) grid scheduling

被引:49
|
作者
McClatchey R. [1 ]
Anjum A. [1 ,3 ]
Stockinger H. [2 ]
Ali A. [3 ]
Willers I. [4 ]
Thomas M. [5 ]
机构
[1] CCS Research Centre, University of the West of England, Bristol
[2] Swiss Institute of Bioinformatics, Lausanne
[3] National University of Sciences and Technology, Rawalpindi
[4] CERN, European Organization for Nuclear Research, Geneva
[5] California Institute of Technology, Pasadena, CA
来源
J. Grid Comput. | 2007年 / 1卷 / 43-64期
关键词
Data intensive; Meta scheduling; Network awareness; Peer-to-peer architectures; Scheduling algorithm;
D O I
10.1007/s10723-006-9059-z
中图分类号
学科分类号
摘要
In Grids scheduling decisions are often made on the basis of jobs being either data or computation intensive: in data intensive situations jobs may be pushed to the data and in computation intensive situations data may be pulled to the jobs. This kind of scheduling, in which there is no consideration of network characteristics, can lead to performance degradation in a Grid environment and may result in large processing queues and job execution delays due to site overloads. In this paper we describe a Data Intensive and Network Aware (DIANA) meta-scheduling approach, which takes into account data, processing power and network characteristics when making scheduling decisions across multiple sites. Through a practical implementation on a Grid testbed, we demonstrate that queue and execution times of data-intensive jobs can be significantly improved when we introduce our proposed DIANA scheduler. The basic scheduling decisions are dictated by a weighting factor for each potential target location which is a calculated function of network characteristics, processing cycles and data location and size. The job scheduler provides a global ranking of the computing resources and then selects an optimal one on the basis of this overall access and execution cost. The DIANA approach considers the Grid as a combination of active network elements and takes network characteristics as a first class criterion in the scheduling decision matrix along with computations and data. The scheduler can then make informed decisions by taking into account the changing state of the network, locality and size of the data and the pool of available processing cycles. © Springer Science + Business Media B.V. 2007.
引用
收藏
页码:43 / 64
页数:21
相关论文
共 50 条
  • [1] A data-aware cognitive engine for scheduling data intensive applications in a grid
    Nagarajan, Vijaya
    Mohamed Mulk Abdul, Maluk
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (01) : 497 - 507
  • [2] Network-aware Grid scheduling
    Caminero, Agustin
    Caminero, Blanca
    Carrion, Carmen
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 1, PROCEEDINGS, 2007, 4805 : 33 - +
  • [3] Network-Aware HEFT Scheduling for Grid
    Yousaf, Muhammad Murtaza
    Welzl, Andmichael
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [4] Network and Data Location Aware Job Scheduling in Grid: Improvement to GridWay Metascheduler
    Kumar, Saumesh
    Kumar, Naveen
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (01): : 87 - 99
  • [5] Network and data location aware job scheduling in grid: Improvement to gridway metascheduler
    Kumar, S. (saumeshkumar@gmail.com), 1600, Science and Engineering Research Support Society, Room 402, Man-Je Bld., 449-8, Ojung-Dong, Daedoek-Gu, Korea, Republic of (05):
  • [6] Data Intensive, Computing and Network Aware (DCN) Cloud VMs Scheduling Algorithm
    Alharbi, Yasser
    Walker, Stuart
    PROCEEDINGS OF 2016 FUTURE TECHNOLOGIES CONFERENCE (FTC), 2016, : 1257 - 1264
  • [7] A scheduling middleware for data intensive applications on a grid
    Lee, Moo-hun
    In, Jang-uk
    Choi, Eui-in
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2006, 4253 : 1058 - 1067
  • [8] Dependency-Aware Network Adaptive Scheduling of Data-Intensive Parallel Jobs
    Wang, Shaoqi
    Chen, Wei
    Zhou, Xiaobo
    Zhang, Liqiang
    Wang, Yin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (03) : 515 - 529
  • [9] Locality and Network-Aware Reduce Task Scheduling for Data-Intensive Applications
    Arslan, Engin
    Shekhar, Mrigank
    Kosar, Tevfik
    2014 5TH INTERNATIONAL WORKSHOP ON DATA-INTENSIVE COMPUTING IN THE CLOUDS (DATACLOUD), 2014, : 17 - 24
  • [10] Smart Grid-aware Scheduling in Data Centres
    Maesker, Markus
    Nagel, Lars
    Brinkmann, Andre
    Lotfifar, Foad
    Johnson, Matthew
    2015 SUSTAINABLE INTERNET AND ICT FOR SUSTAINABILITY (SUSTAINIT), 2015,