A Distributed Fault-Tolerant Design for Multiple-Server VOD Systems

被引:0
|
作者
Ing-Jye Shyu
Shiuh-Pyng Shieh
机构
[1] National Chiao-Tung University,Department of Computer Science and Information Engineering
来源
关键词
fault tolerance; fault recovery; distributed algorithms; multimedia systems;
D O I
暂无
中图分类号
学科分类号
摘要
Fault tolerance is an important design criterion for reliable and robust video-on-demand systems. Conventional fault-tolerant designs use either a primary backup or an active replication method to provide system fault tolerance. However, these approaches suffer from low utilization of the backup or replication system. In this paper we propose two playback-recovery schemes for distributed video-on-demand systems called the forward playback-recovery scheme and the backward playback-recovery scheme. Unlike conventional fault-tolerant designs, our schemes use existing playback resources to recover faulty playbacks without allocating new resources, significantly reducing recovery overhead. To use the schemes effectively, we developed a distributed algorithm for determining the order and gap information between the playbacks on the distributed video-on-demand servers so that overhead for recovering from a server failure can be minimized. This algorithm achieves N − 1 fault-tolerant resiliency for N-server video-on-demand systems. In addition, three server-recovery policies are also presented to guide surviving servers in applying the proper scheme to recover faulty playbacks, thus reducing overall recovery costs. Simulation results show that the proposed recovery schemes are effective and useful in designing fault-tolerant multiple-server video-on-demand systems.
引用
收藏
页码:219 / 247
页数:28
相关论文
共 50 条
  • [31] Evaluation of fault-tolerant distributed web systems
    Hong, YS
    No, JH
    Han, I
    WORDS 2005: 10th IEEE International Workshop on Object-Oriented Real-Time Dependable, Proceedings, 2005, : 148 - 151
  • [32] COMMUNICATIONS IN DISTRIBUTED FAULT-TOLERANT COMPUTING SYSTEMS
    MORGANTI, M
    JOURNAL OF SYSTEMS AND SOFTWARE, 1986, 6 (1-2) : 213 - 216
  • [33] DISTRIBUTED FAULT-TOLERANT COMPUTER-SYSTEMS
    RENNELS, DA
    COMPUTER, 1980, 13 (03) : 55 - 65
  • [34] FAULT-TOLERANT CLOCK SYNCHRONIZATION IN DISTRIBUTED SYSTEMS
    RAMANATHAN, P
    SHIN, KG
    BUTLER, RW
    COMPUTER, 1990, 23 (10) : 33 - 42
  • [35] TASK ALLOCATION IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    BANNISTER, JA
    TRIVEDI, KS
    ACTA INFORMATICA, 1983, 20 (03) : 261 - 281
  • [36] COMMUNICATION STRUCTURES IN FAULT-TOLERANT DISTRIBUTED SYSTEMS
    PRADHAN, DK
    MEYER, FJ
    NETWORKS, 1993, 23 (04) : 379 - 389
  • [37] Fault-tolerant teleoperation systems design
    Dede, Mehmet
    Tosunoglu, Sabri
    INDUSTRIAL ROBOT-AN INTERNATIONAL JOURNAL, 2006, 33 (05) : 365 - 372
  • [38] Distributed Fault Estimation and Fault-Tolerant Control of Interconnected Systems
    Zhang, Ke
    Jiang, Bin
    Chen, Mou
    Yan, Xing-Gang
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (03) : 1230 - 1240
  • [39] OPTIMAL-DESIGN OF FAULT-TOLERANT DISTRIBUTED SYSTEMS BASED ON A RECURSIVE ALGORITHM
    PHAM, H
    UPADHYAYA, SJ
    IEEE TRANSACTIONS ON RELIABILITY, 1991, 40 (03) : 375 - 379
  • [40] Design of Fault-Tolerant Distributed Cyber-Physical Systems for Smart Environments
    Cassano, Luca
    Miele, Antonio
    Mione, Francesco
    Tonellotto, Nicola
    Vallati, Carlo
    IEEE EMBEDDED SYSTEMS LETTERS, 2022, 14 (02) : 79 - 82