RRBS: A fault tolerance model for cluster/grid parallel file system

被引:0
|
作者
Huo, YM [1 ]
Ju, JB [1 ]
Hu, L [1 ]
机构
[1] Jilin Univ, Dept Comp Sci & Technol, Changchun 130012, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel file systems stripe the data from a single file across multiple cluster/grid nodes so that the systems can access file in parallel. In such a system, if an I/O node or the storage device of that node doesn't work, all the subfiles on the node can't be accessed. In this paper, we introduce a special fault tolerance model for parallel file systems called Round-robin Redundant Backup of Subfile (RRBS). This model ensures the accessibility of the parallel files even when an I/O node is failure. In order to test the usability of RRBS, we also developed a prototype of parallel file system called WPFS on a PC/Windows cluster.
引用
收藏
页码:180 / 187
页数:8
相关论文
共 50 条
  • [1] A novel cluster parallel file system
    Wei, Wenguo
    Dong, Shoubin
    Zhang, Ling
    Li, Jialin
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 119 - +
  • [2] An efficient parallel file system for cluster grids
    Frattolillo, F
    D'Onofrio, S
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2005, 3666 : 94 - 101
  • [3] Cooperative caching in the pCFS parallel cluster file system
    Lopes, Paulo A.
    Medeiros, Pedro D.
    HPDC-15: PROCEEDINGS OF THE 15TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2005, : 347 - 348
  • [4] Providing PVM with a parallel file system for cluster grids
    Frattolillo, Franco
    D'Onofrio, Salvatore
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 3, 2005, : 24 - 29
  • [5] Byzantine fault tolerance in MDS of Grid system
    Wang, Xiu-Qun
    Zhuang, Yue-Ting
    Hou, Hong-Lun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2782 - +
  • [6] Fault-tolerance of parallel volume rendering on cluster of PCs
    Guedes, S
    Bentes, C
    da Silva, GP
    Farias, R
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 61 - 66
  • [7] Fault tolerance for cluster-oriented MPI parallel applications
    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
    Qinghua Daxue Xuebao, 2006, 1 (67-69+110):
  • [8] A parallel and fault tolerant file system based on NFS servers
    García, F
    Calderón, A
    Carretero, J
    Pérez, JM
    Fernández, J
    ELEVENTH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2003, : 83 - 90
  • [9] Job-site level fault tolerance for cluster and grid environments
    Limaye, Kshitij
    Leangsuksun, Box
    Greenwood, Zeno
    Scott, Stephen L.
    Engelmann, Christian
    Libby, Richard
    Chanchio, Kasidit
    2005 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2006, : 95 - 103
  • [10] OmniRPC: A grid RPC system for parallel programming in cluster and grid environment
    Sato, M
    Boku, T
    Takahashi, D
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 206 - 213