A comprehensive model for software rejuvenation

被引:141
|
作者
Vaidyanathan, K [1 ]
Trivedi, KS
机构
[1] Sun Microsyst Inc, Scalable Syst Grp, San Diego, CA 92121 USA
[2] Duke Univ, Dept Elect & Comp Engn, Durham, NC 27708 USA
关键词
availability; measurement-based dependability evaluation; semi-Markov reward models; software aging; software rejuvenation; workload characterization;
D O I
10.1109/TDSC.2005.15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the phenomenon of software aging, one in which the state of the software system degrades with time, has been reported. This phenomenon, which may eventually lead to system performance degradation and/or crash/hang failure, is the result of exhaustion of operating system resources, data corruption, and numerical error accumulation. To counteract software aging, a technique called software rejuvenation has been proposed, which essentially involves occasionally terminating an application or a system, cleaning its internal state and/or its environment, and restarting it. Since rejuvenation incurs an overhead, an important research issue is to determine optimal times to initiate this action. In this paper, we first describe how to include faults attributed to software aging in the framework of Gray's software fault classification (deterministic and transient), and study the treatment and recovery strategies for each of the fault classes: We then construct,a semi-Markov reward model based on workload and resource usage data collected from the UNIX operating system. We identify different workload states using statistical cluster analysis, estimate transition probabilities, and sojourn time distributions from the data. Corresponding to each resource, a reward function is then defined for the model based on the rate of resource depletion in each state. The model is then solved to obtain estimated times to exhaustion for each resource. The result from the semi-Markov reward model are then fed into a higher-level availability model that accounts for failure followed by reactive recovery, as well as proactive recovery. This comprehensive model is then used to derive optimal rejuvenation schedules that maximize availability or minimize downtime cost.
引用
收藏
页码:124 / 137
页数:14
相关论文
共 50 条
  • [1] A comprehensive approach to optimal software rejuvenation
    Zhao, Jing
    Wang, YanBin
    Ning, GaoRong
    Trivedi, Kishor S.
    Matias, Rivalino, Jr.
    Cai, Kai-Yuan
    [J]. PERFORMANCE EVALUATION, 2013, 70 (11) : 917 - 933
  • [2] A Comprehensive Model of Android Software Aging and Rejuvenation Considering Battery Saving
    Yakovyna, Vitaliy
    Uhrynovskyi, Bohdan
    Shakhovska, Natalya
    [J]. ELECTRONICS, 2023, 12 (07)
  • [3] Adaptive software rejuvenation: Degradation model and rejuvenation scheme
    Bao, YJ
    Sun, XB
    Trivedi, KS
    [J]. 2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2003, : 241 - 248
  • [4] Software rejuvenation model based on reconfiguration and periodical rejuvenation
    Du, Xiaozhi
    Qi, Yong
    Hou, Di
    Chen, Ying
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2010, 44 (01): : 91 - 95
  • [5] Rejuvenation Action Model for Application Software
    Abidin, Zaiha Nadiah Zainal
    Yahaya, Jamaiah H.
    Deraman, Aziz
    Abdullah, Zuriani Hayati
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 38 - 43
  • [6] A New Software Rejuvenation Model for Android
    Xiang, Jianwen
    Weng, Caisheng
    Zhao, Dongdong
    Tian, Jing
    Xiong, Shengwu
    Li, Lin
    Andrzejak, Artur
    [J]. 2018 29TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2018, : 293 - 299
  • [7] A Comprehensive Evaluation of Software Rejuvenation Policies for Transaction Systems With Markovian Arrivals
    Zheng, Junjun
    Okamura, Hiroyuki
    Li, Lin
    Dohi, Tadashi
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2017, 66 (04) : 1157 - 1177
  • [8] Comprehensive evaluation of aperiodic checkpointing and rejuvenation schemes in operational software system
    Okamura, Hiroyuki
    Dohi, Tadashi
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (09) : 1591 - 1604
  • [9] A software availability model based on multilevel software rejuvenation and markov chain
    Rahmani Ghobadi, Zahra
    Rashidi, Hassan
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (02) : 730 - 744
  • [10] Software Aging and Software Rejuvenation
    Trivedi, Kishor
    [J]. PROCEEDINGS OF THE 2019 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '19), 2019, : 1 - 1