A Task Scheduling Algorithm Based on Replication for Maximizing Reliability on Heterogeneous Computing Systems

被引:3
|
作者
Wang, Shuli [1 ]
Li, Kenli [1 ]
Mei, Jing [1 ]
Li, Keqin [1 ,2 ]
Wang, Yan [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Directed acyclic graph; Heterogeneous computing systems; Reliability-aware scheduling; Replication-based algorithm; ALLOCATION; TIME;
D O I
10.1109/IPDPSW.2014.175
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the past several years, a heterogeneous computing (HC) system has become more competative as a commercial computing platform than a homogeneous system. With the growing scale of HC systems, network failures become inevitable. To achieve high performance, communication reliability should be considered while designing reliability-aware task scheduling algorithms. In this paper, we propose a new algorithm called RMSR (Replication-based scheduling for Maximizing System Reliability), which incorporates task communication into system reliability. To maximize communication reliability, an improved algorithm which searches all optimal reliability communication paths for current tasks is proposed. During the task replication phase, the task reliability threshold is determined by users and each task has dynamic replicas. Our comparative studies based on randomly generated graphs show that our RMSR algorithm outperforms existing scheduling algorithms in terms of system reliability. Several factors affecting the performance are analyzed in the paper.
引用
收藏
页码:1562 / 1571
页数:10
相关论文
共 50 条
  • [1] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Shuli Wang
    Kenli Li
    Jing Mei
    Guoqing Xiao
    Keqin Li
    [J]. Journal of Grid Computing, 2017, 15 : 23 - 39
  • [2] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Wang, Shuli
    Li, Kenli
    Mei, Jing
    Xiao, Guoqing
    Li, Keqin
    [J]. JOURNAL OF GRID COMPUTING, 2017, 15 (01) : 23 - 39
  • [3] A Task Scheduling Algorithm for Heterogeneous Distributed Computing Systems
    Badral, Undrakh
    Kim, Jin Suk
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2008, 11 (05): : 553 - 560
  • [4] Task allocation algorithms for maximizing reliability of heterogeneous distributed computing systems
    Mahmood, A
    [J]. CONTROL AND CYBERNETICS, 2001, 30 (01): : 115 - 130
  • [5] A Task-type-based Algorithm for the Energy-aware Profit Maximizing Scheduling Problem in Heterogeneous Computing Systems
    Li, Weidong
    Liu, Xi
    Zhang, Xuejie
    Cai, Xiaobo
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1107 - 1110
  • [6] HETS: Heterogeneous Edge and Task Scheduling Algorithm for Heterogeneous Computing Systems
    Masood, Anum
    Munir, Ehsan Ullah
    Rafique, M. Mustafa
    Khan, Samee U.
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1865 - 1870
  • [7] A novel task scheduling algorithm for distributed heterogeneous computing systems
    Lai, Guan-Joe
    [J]. APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 1115 - 1122
  • [8] Starvation Avoidance Task Scheduling Algorithm for Heterogeneous Computing Systems
    Gawanmeh, Amjad
    Mansoor, Wathiq
    Abed, Sa'ed
    Kablaoui, Darin
    Al Faisal, Hala
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 1794 - 1799
  • [9] A Reliability Task Scheduling Algorithm with Optimizing Makespan in Heterogeneous Systems
    Jing Wei-Peng
    Wu Zhi-Bo
    Liu Hong-Wei
    Dong Jian
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [10] Task scheduling for heterogeneous computing systems
    AlEbrahim, Shaikhah
    Ahmad, Imtiaz
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2313 - 2338