Peer-to-peer Cooperative Scheduling Architecture for National Grid Infrastructure

被引:2
|
作者
Matyska, Ludek [1 ]
Ruda, Miroslav [1 ]
Toth, Simon [1 ]
机构
[1] CESNET, Zikova 4, Czech Republic
关键词
D O I
10.1007/978-1-4419-8014-4_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For some ten years, the Czech National Grid Infrastructure MetaCentrum uses a single central PBSPro installation to schedule jobs across the country. This centralized approach keeps a full track about all the clusters, providing support for jobs spanning several sites, implementation for the fair-share policy and better overall control of the grid environment. Despite a steady progress in the increased stability and resilience to intermittent very short network failures, growing number of sites and processors makes this architecture, with a single point of failure and scalability limits, obsolete. As a result, a new scheduling architecture is proposed, which relies on higher autonomy of clusters. It is based on a peer to peer network of semi-independent schedulers for each site or even cluster. Each scheduler accepts jobs for the whole infrastructure, cooperating with other schedulers on implementation of global policies like central job accounting, fair-share, or submission of jobs across several sites. The scheduling system is integrated with the Magrathea system to support scheduling of virtual clusters, including the setup of their internal network, again eventually spanning several sites. On the other hand, each scheduler is local to one of several clusters and is able to directly control and submit jobs to them even if the connection of other scheduling peers is lost. In parallel to the change of the overall architecture, the scheduling system itself is being replaced. Instead of PBSPro, chosen originally for its declared support of large scale distributed environment, the new scheduling architecture is based on the open-source Torque system. The implementation and support for the most desired properties in PBSPro and Torque are discussed and the necessary modifications to Torque to support the MetaCentrum scheduling architecture are presented, too.
引用
收藏
页码:105 / 118
页数:14
相关论文
共 50 条
  • [31] Research of peer-to-peer network architecture
    Li, ZP
    Huang, DY
    Liu, IR
    Huang, JH
    [J]. 2003 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOL 1 AND 2, PROCEEDINGS, 2003, : 312 - 315
  • [32] Peer-to-peer mobile network architecture
    Charas, P
    [J]. FIRST INTERNATIONAL CONFERENCE ON PEER-TO-PEER COMPUTING, 2002, : 55 - 61
  • [33] A peer-to-peer infrastructure for resilient Web Services
    Norcross, Stuart J.
    Dearle, Alan
    Kirby, Graham N. C.
    Walker, Scott M.
    [J]. FIRST INTERNATIONAL WORKSHOP ON ADVANCED ARCHITECTURES AND ALGORITHMS FOR INTERNET DELIVERY AND APPLICATIONS, PROCEEDINGS, 2006, : 65 - +
  • [34] DRALIC: A peer-to-peer storage architecture
    He, XB
    Zhang, M
    Yang, Q
    [J]. PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 908 - 912
  • [35] A peer-to-peer network positioning architecture
    Onbilger, GK
    Chen, SG
    Chow, R
    [J]. 2004 12TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, VOLS 1 AND 2 , PROCEEDINGS: UNITY IN DIVERSITY, 2004, : 277 - 283
  • [36] A Peer-to-Peer Secure VoIP Architecture
    Cirani, Simone
    Pecori, Riccardo
    Veltri, Luca
    [J]. TRUSTWORTHY INTERNET, 2011, : 105 - 115
  • [37] A peer-to-peer architecture for mobile communications
    Gonen, EK
    Xu, H
    Joshi, P
    [J]. 2nd International Symposium on Wireless Communications Systems 2005 (ISWCS 2005), 2005, : 293 - 297
  • [38] A peer-to-peer architecture for media streaming
    Tran, DA
    Hua, KA
    Do, TT
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2004, 22 (01) : 121 - 133
  • [39] A PEER-TO-PEER COMMUNICATION INFRASTRUCTURE FOR GROUPWARE APPLICATIONS
    Gotthelf, Pablo
    Zunino, Alejandro
    Campo, Marcelo
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2008, 17 (04) : 523 - 554
  • [40] Peer-to-peer cooperative caching in mobile environments
    Chow, CY
    Leong, HV
    Chan, A
    [J]. 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS, PROCEEDINGS, 2004, : 528 - 533