Data intensive and network aware (DIANA) grid scheduling

被引:49
|
作者
McClatchey R. [1 ]
Anjum A. [1 ,3 ]
Stockinger H. [2 ]
Ali A. [3 ]
Willers I. [4 ]
Thomas M. [5 ]
机构
[1] CCS Research Centre, University of the West of England, Bristol
[2] Swiss Institute of Bioinformatics, Lausanne
[3] National University of Sciences and Technology, Rawalpindi
[4] CERN, European Organization for Nuclear Research, Geneva
[5] California Institute of Technology, Pasadena, CA
来源
J. Grid Comput. | 2007年 / 1卷 / 43-64期
关键词
Data intensive; Meta scheduling; Network awareness; Peer-to-peer architectures; Scheduling algorithm;
D O I
10.1007/s10723-006-9059-z
中图分类号
学科分类号
摘要
In Grids scheduling decisions are often made on the basis of jobs being either data or computation intensive: in data intensive situations jobs may be pushed to the data and in computation intensive situations data may be pulled to the jobs. This kind of scheduling, in which there is no consideration of network characteristics, can lead to performance degradation in a Grid environment and may result in large processing queues and job execution delays due to site overloads. In this paper we describe a Data Intensive and Network Aware (DIANA) meta-scheduling approach, which takes into account data, processing power and network characteristics when making scheduling decisions across multiple sites. Through a practical implementation on a Grid testbed, we demonstrate that queue and execution times of data-intensive jobs can be significantly improved when we introduce our proposed DIANA scheduler. The basic scheduling decisions are dictated by a weighting factor for each potential target location which is a calculated function of network characteristics, processing cycles and data location and size. The job scheduler provides a global ranking of the computing resources and then selects an optimal one on the basis of this overall access and execution cost. The DIANA approach considers the Grid as a combination of active network elements and takes network characteristics as a first class criterion in the scheduling decision matrix along with computations and data. The scheduler can then make informed decisions by taking into account the changing state of the network, locality and size of the data and the pool of available processing cycles. © Springer Science + Business Media B.V. 2007.
引用
收藏
页码:43 / 64
页数:21
相关论文
共 50 条
  • [41] An enhanced data-aware scheduling algorithm for batch-mode dataintensive jobs on data grid
    Jiang, Jianhua
    Xu, Gaochao
    Wei, Xiaohui
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2006, : 257 - +
  • [42] Structure aware resource estimation for effective scheduling and execution of data intensive workflows in cloud
    Kanagaraj, K.
    Swamynathan, S.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 79 : 878 - 891
  • [43] TRACON: Interference-Aware Scheduling for Data-Intensive Applications in Virtualized Environments
    Chiang, Ron C.
    Huang, H. Howie
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (05) : 1349 - 1358
  • [44] An Energy-Aware Heuristic Scheduling for Data-Intensive Workflows in Virtualized Datacenters
    Xiao, Peng
    Hu, Zhi-Gang
    Zhang, Yan-Ping
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2013, 28 (06) : 948 - 961
  • [45] A New Data-Intensive Task Scheduling in OptorSim, an Open Source Grid Simulator
    Moghadam, Mahshid Helali
    Babamir, Seyyed Morteza
    2016 2ND INTERNATIONAL CONFERENCE ON OPEN SOURCE SOFTWARE COMPUTING (OSSCOM), 2016,
  • [46] Heuristic-based scheduling to maximize throughput of data-intensive grid applications
    Ray, S
    Zhang, Z
    DISTRIBUTED COMPUTING - IWDC 2004, PROCEEDINGS, 2004, 3326 : 63 - 74
  • [47] New worker-centric scheduling strategies for data-intensive grid applications
    Ko, Steven Y.
    Morales, Ramses
    Gupta, Indranil
    MIDDLEWARE 2007, PROCEEDINGS, 2007, 4834 : 121 - 142
  • [48] Network Bandwidth-aware job scheduling with dynamic information model for Grid resource brokers
    Chao-Tung Yang
    Fang-Yie Leu
    Sung-Yi Chen
    The Journal of Supercomputing, 2010, 52 : 199 - 223
  • [49] Network Bandwidth-aware job scheduling with dynamic information model for Grid resource brokers
    Yang, Chao-Tung
    Leu, Fang-Yie
    Chen, Sung-Yi
    JOURNAL OF SUPERCOMPUTING, 2010, 52 (03): : 199 - 223
  • [50] A Network Bandwidth-Aware Job Scheduling with Dynamic Information Model for Grid Resource Brokers
    Yang, Chao-Tung
    Leu, Fang-Yie
    Hu, Wen-Jen
    2008 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, VOLS 1-3, PROCEEDINGS, 2008, : 775 - 780