Resource-aware distributed scheduling strategies for large-scale computational Cluster/Grid systems

被引:43
|
作者
Viswanathan, Sivakumar [1 ]
Veeravalli, Bharadwaj
Robertazzi, Thomas G.
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, CNDS Lab, Singapore, Singapore
[2] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
关键词
divisible loads; grid computing; cluster computing; buffer constraints; processing time; deadlines;
D O I
10.1109/TPDS.2007.1073
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose distributed algorithms referred to as Resource-Aware Dynamic Incremental Scheduling ( RADIS) strategies. Our strategies are specifically designed to handle large volumes of computationally intensive arbitrarily divisible loads submitted for processing at cluster/grid systems involving multiple sources and sinks ( processing nodes). We consider a real-life scenario, wherein the buffer space ( memory) available at the sinks ( required for holding and processing the loads) varies over time, and the loads have deadlines and propose efficient "pull-based" scheduling strategies with an admission control policy that ensures that the admitted loads are processed, satisfying their deadline requirements. The design of our proposed strategies adopts the divisible load paradigm, referred to as the divisible load theory ( DLT), which is shown to be efficient in handling large volume loads. We demonstrate detailed workings of the proposed algorithms via a simulation study by using real-life parameters obtained from a major physics experiment.
引用
收藏
页码:1450 / 1461
页数:12
相关论文
共 50 条
  • [1] Resource-aware allocation strategies for divisible loads on large-scale systems
    Benoit, Anne
    Marchal, Loris
    Pineau, Jean-Francois
    Robert, Yves
    Vivien, Frederic
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 1433 - +
  • [2] REMO: Resource-Aware Application State Monitoring for Large-Scale Distributed Systems
    Meng, Shicong
    Kashyap, Srinivas R.
    Venkatramani, Chitra
    Liu, Ling
    [J]. 2009 29TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2009, : 248 - +
  • [3] Autonomous Resource-Aware Scheduling of Large-Scale Media Workflows
    Desmet, Stein
    Volckaert, Bruno
    De Turck, Filip
    [J]. MECHANISMS FOR AUTONOMOUS MANAGEMENT OF NETWORKS AND SERVICES, 2010, 6155 : 50 - 64
  • [4] A Resource-Aware Task Scheduling Algorithm on Mobile Computational Grid
    Chang, Yue-Shan
    Chang, Hung-Hsiang
    Sheu, Ruey-Kai
    Tsai, Ching-Tsorng
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2011, 12 (02): : 279 - 291
  • [5] DRACO: Distributed Resource-aware Admission Control for large-scale, multi-tier systems
    Cotroneo, Domenico
    Natella, Roberto
    Rosiello, Stefano
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 192
  • [6] The failure-rate aware scheduling policies for large-scale cluster systems
    Wu Linping
    Chao, Ren
    Dan, Meng
    Zhan, Jianfeng
    Bibo, Tu
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 364 - +
  • [7] A Review of Resource Scheduling in Large-Scale Server Cluster
    He, Libo
    Qiang, Zhenping
    Zhou, Wei
    Yao, Shaowen
    [J]. KNOWLEDGE MANAGEMENT IN ORGANIZATIONS (KMO 2017), 2017, 731 : 494 - 505
  • [8] Application-aware deadline constraint job scheduling mechanism on large-scale computational grid
    Tang, Xiaoyong
    Liao, Xiaoyi
    [J]. PLOS ONE, 2018, 13 (11):
  • [9] Resource-aware hybrid scheduling algorithm in heterogeneous distributed computing
    Vasile, Mihaela-Andreea
    Pop, Florin
    Tutueanu, Radu-Ioan
    Cristea, Valentin
    Kolodziej, Joanna
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 51 : 61 - 71
  • [10] Low Latency and Resource-aware Program Composition for Large-scale Data Analysis
    Tanaka, Masahiro
    Taura, Kenjiro
    Torisawa, Kentaro
    [J]. 2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 325 - 330