Optimal Recovery from Large-Scale Failures in IP Networks

被引:7
|
作者
Zheng, Qiang [1 ]
Cao, Guohong [1 ]
La Porta, Tom [1 ]
Swami, Ananthram [2 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] US Army, Res Lab, Adelphi, MD USA
关键词
D O I
10.1109/ICDCS.2012.47
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Quickly recovering IP networks from failures is critical to enhancing Internet robustness and availability. Due to their serious impact on network routing, large-scale failures have received increasing attention in recent years. We propose an approach called Reactive Two-phase Rerouting (RTR) for intra-domain routing to quickly recover from large-scale failures with the shortest recovery paths. To recover a failed routing path, RTR first forwards packets around the failure area to collect information on failures. Then, in the second phase, RTR calculates a new shortest path and forwards packets along it through source routing. RTR can deal with large-scale failures associated with areas of any shape and location, and is free of permanent loops. For any failure area, the recovery paths provided by RTR are guaranteed to be the shortest. Extensive simulations based on ISP topologies show that RTR can find the shortest recovery paths for more than 98.6% of failed routing paths with reachable destinations. Compared with prior works, RTR achieves better performance for recoverable failed routing paths and uses much less network resources for irrecoverable failed routing paths.
引用
收藏
页码:295 / 304
页数:10
相关论文
共 50 条
  • [21] Local floods induce large-scale abrupt failures of road networks
    Wang, Weiping
    Yang, Saini
    Stanley, H. Eugene
    Gao, Jianxi
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [22] Handling large-scale node failures in mobile sensor/robot networks
    Akkaya, Kemal
    Senturk, Izzet F.
    Vemulapalli, Shanthi
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2013, 36 (01) : 195 - 210
  • [23] Routing solution for VoIP calls in large-scale IP MM networks
    Basic, L
    Vizek, U
    Bolt, V
    Naglic, Z
    Filipovic-Juric, E
    Njezic, Z
    MELECON 2004: PROCEEDINGS OF THE 12TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1-3, 2004, : 673 - 676
  • [24] Configuration and Optimization for Virtualization based Large-scale IP Networks Emulation
    Li Dawei
    2014 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2014, : 277 - 281
  • [25] Performance analysis of large-scale IP networks considering TCP traffic
    Hisamatsu, Hiroyuki
    Hasegawa, Go
    Murata, Masayuki
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (10) : 2845 - 2853
  • [26] Optimal volume anomaly detection and isolation in large-scale IP networks using coarse-grained measurements
    Casas, P.
    Vaton, S.
    Fillatre, L.
    Nikiforov, I.
    COMPUTER NETWORKS, 2010, 54 (11) : 1750 - 1766
  • [27] AUTONOMOUS FAULT DETECTION AND RECOVERY SYSTEM IN LARGE-SCALE NETWORKS
    Memon, Raheel Ahmed
    Li, Jian Ping
    Shah, Fadia
    2016 13TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2016, : 285 - 288
  • [28] netCSI: A Generic Fault Diagnosis Algorithm for Large-Scale Failures in Computer Networks
    Tati, Srikar
    Rager, Scott
    Ko, Bong Jun
    Cao, Guohong
    Swami, Ananthram
    La Porta, Thomas
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2016, 13 (03) : 355 - 368
  • [29] netCSI: A Generic Fault Diagnosis Algorithm for Large-Scale Failures in Computer Networks
    Tati, Srikar
    Rager, Scott
    Ko, Bong Jun
    Cao, Guohong
    Swami, Ananthram
    La Porta, Thomas
    2011 30TH IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2011, : 167 - 176
  • [30] Understanding Blackholes in Large-Scale Cognitive Radio Networks under Generic Failures
    Sun, Lei
    Wang, Wenye
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 728 - 736