A Cluster-Based Implementation of a Fault Tolerant Parallel Reduction Algorithm Using Swarm-Array Computing

被引:2
|
作者
Varghese, Blesson [1 ]
McKee, Gerard [1 ]
Alexandrov, Vassil [1 ]
机构
[1] Univ Reading, Sch Syst Engn, Reading RG6 6AY, Berks, England
关键词
swarm-array computing; intelligent agents; fault-tolerant system; cluster-based implementation;
D O I
10.1109/ICAS.2010.13
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent research in multi-agent systems incorporate fault tolerance concepts. However, the research does not explore the extension and implementation of such ideas for large scale parallel computing systems. The work reported in this paper investigates a swarm array computing approach, namely 'Intelligent Agents'. In the approach considered a task to be executed on a parallel computing system is decomposed to sub-tasks and mapped onto agents that traverse an abstracted hardware layer. The agents intercommunicate across processors to share information during the event of a predicted core/processor failure and for successfully completing the task. The agents hence contribute towards fault tolerance and towards building reliable systems. The feasibility of the approach is validated by simulations on an FPGA using a multi-agent simulator and implementation of a parallel reduction algorithm on a computer cluster using the Message Passing Interface.
引用
收藏
页码:30 / 36
页数:7
相关论文
共 50 条
  • [41] An energy-aware cluster-based routing in the Internet of things using particle swarm optimization algorithm and fuzzy clustering
    Lei, Chang
    Journal of Engineering and Applied Science, 2024, 71 (01):
  • [42] DESIGN OF ALGORITHM-BASED FAULT-TOLERANT VLSI ARRAY PROCESSOR
    LIU, CM
    JEN, CW
    IEE PROCEEDINGS-E COMPUTERS AND DIGITAL TECHNIQUES, 1989, 136 (06): : 539 - 547
  • [43] Decentralized Fault - Tolerant Weights Based Algorithm for Coordination of Swarm Robots for a Disaster Scenario
    Aniketh, R.
    Manohar, E. B.
    Yazwa, G. R. S. Pruthvi Ram
    Nithya, M.
    Rashmi, M. R.
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [44] Fault-tolerant Control Algorithm of Neural Network Based on Particle Swarm Optimization
    Zhou Li-qun
    Li Shu-chen
    Su Cheng-li
    Zhai Chun-yan
    2011 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, 2011, : 700 - 704
  • [45] An Algorithm of Angular Superresolution Using the Cholesky Decomposition and Its Implementation Based on Parallel Computing Technology
    S. E. Mishchenko
    N. V. Shatskiy
    Automatic Control and Computer Sciences, 2023, 57 : 661 - 671
  • [46] An Algorithm of Angular Superresolution Using the Cholesky Decomposition and Its Implementation Based on Parallel Computing Technology
    Mishchenko, S. E.
    Shatskiy, N. V.
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2023, 57 (07) : 661 - 671
  • [47] Implementation of the QGD Algorithm Using AMR Technology and GPU Parallel Computing
    But, Ivan
    Epikhin, Andrey
    Kirushina, Maria
    Elizarova, Tatiana
    COMPUTATIONAL SCIENCE, ICCS 2024, PT VII, 2024, 14838 : 85 - 99
  • [48] Implementation of a Parallel Algorithm for Protein Pairwise Alignment Using Reconfigurable Computing
    Moritz, Guilherme L.
    Jory, Cristiano
    Lopes, Heitor S.
    Lima, Carlos R. Erig
    RECONFIG 2006: PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGA'S, 2006, : 99 - +
  • [49] The CCM based implementation of the parallel variant of BiCG algorithm suitable for massively parallel computing
    Rybarczyk, Andrzej
    Szulc, Michal
    Wencel, Jaroslaw
    PAR ELEC 2006: INTERNATIONAL SYMPOSIUM ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2006, : 301 - +
  • [50] Parallel Implementation of Swarm Intelligent Algorithms in a Spark-Based Cloud Computing Environment
    Zhu, Jun
    Wang, Yushen
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (16)