A high availability mechanism for parallel file system

被引:0
|
作者
Zhang, H [1 ]
Wu, WG
Dong, XS
Qian, DP
机构
[1] Xian Jiaotong Univ, Dept Comp Sci, Xian 710049, Shaanxi, Peoples R China
[2] Beihang Univ, Sch Comp Sci, Beijing 100083, Peoples R China
来源
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS | 2005年 / 3756卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel file systems achieve a high I/O throughput by dividing a file into multiple blocks and storing them on multiple I/O nodes. However, the reliability and availability of the parallel file systems are sacrificed for the stripping of file data over multi I/O nodes. A new mechanism named Logic Mirror Ring (LMR), has been developed to improve the reliability and availability of the parallel file systems in this study. A logic mirror ring is built over all I/O nodes to indicate the mirror relationship among the nodes, i.e., each node maintains not only its own data but also the mirror data of other nodes. The fault tolerant capability of the system is improved because the node maintaining the mirror data of the failed node will takeover the requests to the failed node. The mirror depth can be adjusted to different levels based on the requirements of the reliability and availability. A model is developed to evaluate the reliability and availability of the paxallel file systems. The effects of LMR on the reliability and availability of the parallel file system is studied. The results show that LMR can be used to improve the reliability and availability of the parallel file systems effectively.
引用
收藏
页码:194 / 203
页数:10
相关论文
共 50 条
  • [41] A Windows-based parallel file system
    Yeh, Lungpin
    Sun, Juei-Ting
    Hung, Sheng-Kai
    Hsu, Yarsun
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 7 - 18
  • [42] A flexible multiagent parallel file system for clusters
    Pérez, MS
    Carretero, J
    García, F
    Peña, JM
    Robles, V
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 248 - 256
  • [43] Techniques for an Energy Aware Parallel File System
    Karakoyunlu, Cengiz
    Chandy, John A.
    2012 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2012,
  • [44] Performing Cloud Computation on a Parallel File System
    Wilson, Ellis
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1545 - 1545
  • [45] Performance of the IBM general parallel file system
    Lawrence Livermore Natl Lab, Livermore, United States
    Proceedings of the International Parallel Processing Symposium, IPPS, 2000, : 673 - 681
  • [46] A cluster file system for high data availability using locality-aware partial replication
    Kim, Jinseok
    Sim, Sangman
    Park, Sungyong
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 345 - 350
  • [47] A Distributed File-Based Storage System for Improving High Availability of Space Weather Data
    Andrian, Yoga
    Kim, Hyeonwoo
    Ju, Hongtaek
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [48] Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects?
    Islam, Nusrat S.
    Lu, Xiaoyi
    Wasi-ur-Rahman, Md
    Panda, Dhabaleswar K.
    2013 IEEE 21ST ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI), 2013, : 75 - 78
  • [49] Leveraging OSD plus devices for implementing a high-throughput parallel file system
    Piernas, Juan
    Gonzalez-Ferez, Pilar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (21):
  • [50] A high-performance distributed parallel file system for data-intensive computations
    Shen, XH
    Choudhary, A
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (10) : 1157 - 1167