RRBS: A fault tolerance model for cluster/grid parallel file system

被引：0

作者：

Huo, YM ^{[1
]}

Ju, JB ^{[1
]}

Hu, L ^{[1
]}

机构：

[1] Jilin Univ, Dept Comp Sci & Technol, Changchun 130012, Peoples R China

来源：

PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS | 2005年 / 3758卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Parallel file systems stripe the data from a single file across multiple cluster/grid nodes so that the systems can access file in parallel. In such a system, if an I/O node or the storage device of that node doesn't work, all the subfiles on the node can't be accessed. In this paper, we introduce a special fault tolerance model for parallel file systems called Round-robin Redundant Backup of Subfile (RRBS). This model ensures the accessibility of the parallel files even when an I/O node is failure. In order to test the usability of RRBS, we also developed a prototype of parallel file system called WPFS on a PC/Windows cluster.

引用

页码：180 / 187

页数：8

共 50 条

[1] A novel cluster parallel file system
Wei, Wenguo
Dong, Shoubin
Zhang, Ling
Li, Jialin
SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 119 - +
[2] An efficient parallel file system for cluster grids
Frattolillo, F
D'Onofrio, S
RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2005, 3666 : 94 - 101
[3] Cooperative caching in the pCFS parallel cluster file system
Lopes, Paulo A.
Medeiros, Pedro D.
HPDC-15: PROCEEDINGS OF THE 15TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2005, : 347 - 348
[4] Providing PVM with a parallel file system for cluster grids
Frattolillo, Franco
D'Onofrio, Salvatore
WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 3, 2005, : 24 - 29
[5] Byzantine fault tolerance in MDS of Grid system
Wang, Xiu-Qun
Zhuang, Yue-Ting
Hou, Hong-Lun
PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2782 - +
[6] Fault-tolerance of parallel volume rendering on cluster of PCs
Guedes, S
Bentes, C
da Silva, GP
Farias, R
PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 61 - 66
[7] Fault tolerance for cluster-oriented MPI parallel applications
Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
Qinghua Daxue Xuebao, 2006, 1 (67-69+110):
[8] A parallel and fault tolerant file system based on NFS servers
García, F
Calderón, A
Carretero, J
Pérez, JM
Fernández, J
ELEVENTH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2003, : 83 - 90
[9] Job-site level fault tolerance for cluster and grid environments
Limaye, Kshitij
Leangsuksun, Box
Greenwood, Zeno
Scott, Stephen L.
Engelmann, Christian
Libby, Richard
Chanchio, Kasidit
2005 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2006, : 95 - 103
[10] OmniRPC: A grid RPC system for parallel programming in cluster and grid environment
Sato, M
Boku, T
Takahashi, D
CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 206 - 213

← 1 2 3 4 5 →