High-Availability Computing Platform with Sensor Fault Resilience

被引:3
|
作者
Lee, Yen-Lin [1 ]
Arizky, Shinta Nuraisya [1 ]
Chen, Yu-Ren [1 ,2 ]
Liang, Deron [1 ]
Wang, Wei-Jen [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 320, Taiwan
[2] Inst Informat Ind, Taipei 106, Taiwan
关键词
failover; high availability; sensor fault; fault detection and recovery; liveness detection;
D O I
10.3390/s21020542
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Modern computing platforms usually use multiple sensors to report system information. In order to achieve high availability (HA) for the platform, the sensors can be used to efficiently detect system faults that make a cloud service not live. However, a sensor may fail and disable HA protection. In this case, human intervention is needed, either to change the original fault model or to fix the sensor fault. Therefore, this study proposes an HA mechanism that can continuously provide HA to a cloud system based on dynamic fault model reconstruction. We have implemented the proposed HA mechanism on a four-layer OpenStack cloud system and tested the performance of the proposed mechanism for all possible sets of sensor faults. For each fault model, we inject possible system faults and measure the average fault detection time. The experimental result shows that the proposed mechanism can accurately detect and recover an injected system fault with disabled sensors. In addition, the system fault detection time increases as the number of sensor faults increases, until the HA mechanism is degraded to a one-system-fault model, which is the worst case as the system layer heartbeating.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [21] Using Thermal-Aware VM Migration Mechanism for High-Availability Cloud Computing
    Chen, Ying-Jun
    Horng, Gwo-Jiun
    Li, Jian-Hua
    Cheng, Sheng-Tzong
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 97 (01) : 1475 - 1502
  • [22] ACHIEVING HIGH-AVAILABILITY BATCH CONTROL
    LENGYEL, L
    [J]. INTECH, 1988, 35 (08) : 47 - 48
  • [23] Dual-uCPE for High-Availability Retailer Services with Fault Toleranceand Load Balancing
    Lin, Ying-Dar
    Ho, Kuan-Yu
    Yahya, Widhi
    Li, Chi-Yu
    Lai, Yuan-Cheng
    Tseng, Jeans H.
    [J]. IEEE Internet of Things Magazine, 2023, 6 (04): : 88 - 95
  • [24] Death by Babble: Security and Fault Tolerance of Distributed Consensus in High-Availability Softwarized Networks
    Hanmer, Robert
    Liu, Sheng
    Jagadeesan, Lalita
    Rahman, Muntasir Raihan
    [J]. PROCEEDINGS OF THE 2019 IEEE CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2019), 2019, : 266 - 270
  • [25] Research on High-Availability of Softswitch System
    LOU Zhi-qiang1
    2.School of Telecommunication Engineering
    [J]. The Journal of China Universities of Posts and Telecommunications, 2006, (02) : 50 - 53
  • [26] HIGH-AVAILABILITY COMPUTER-SYSTEMS
    GRAY, J
    SIEWIOREK, DP
    [J]. COMPUTER, 1991, 24 (09) : 39 - 48
  • [27] High-Availability Service Chain Realization Theory
    Sharma, Sidharth
    Gumaste, Ashwin
    Tatipamula, Mallik
    [J]. 2020 16TH INTERNATIONAL CONFERENCE ON THE DESIGN OF RELIABLE COMMUNICATION NETWORKS DRCN 2020, 2020,
  • [28] Research on High-Availability Based on Architecture of ForCES
    Li, Qun
    Dong, Ligang
    Gao, Ming
    [J]. 2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 2, PROCEEDINGS, 2009, : 537 - 540
  • [29] Sustaining High-Availability and Quality of Web Services
    Lim, Erbin
    Thiran, Philippe
    [J]. CURRENT TRENDS IN WEB ENGINEERING, 2010, 6385s : 560 - 565
  • [30] A High-availability Data Backup Strategy for IPFS
    Shi, LinFei
    Luo, Hong
    Yang, XueMei
    Sun, Yan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,