TPR: Traffic Pattern-Based Adaptive Routing for Dragonfly Networks

被引:11
|
作者
Faizian, Peyman [1 ]
Alfaro, Juan Francisco [2 ]
Rahman, Md Shafayat [2 ]
Mollah, Md Atiqul [3 ]
Yuan, Xin [2 ]
Pakin, Scott [4 ]
Lang, Michael [4 ]
机构
[1] Univ North Florida, Sch Comp, Jacksonville, FL 32224 USA
[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
[3] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
[4] Los Alamos Natl Lab, Dept Comp Computat & Stat Sci, Los Alamos, NM 87545 USA
关键词
Dragonfly topology; cray cascade; traffic pattern-based adaptive routing;
D O I
10.1109/TMSCS.2018.2877264
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Cray Cascade architecture uses Dragonfly as its interconnect topology and employs a globally adaptive routing scheme called UGAL. UGAL directs traffic based on link loads but may make inappropriate adaptive routing decisions in various situations, which degrades its performance. In this work, we propose traffic pattern-based adaptive routing (TPR) for Dragonfly that improves UGAL by incorporating a traffic pattern-based adaptation mechanism. The idea is to explicitly use the link usage statistics that are collected in performance counters to infer the traffic pattern, and to take the inferred traffic pattern plus link loads into consideration when making adaptive routing decisions. Our performance evaluation results on a diverse set of traffic conditions indicate that by incorporating the traffic pattern-based adaptation mechanism, TPR is much more effective in making adaptive routing decisions and achieves significant lower latency under low load and higher throughput under high load than its underlying UGAL.
引用
收藏
页码:931 / 943
页数:13
相关论文
共 50 条
  • [1] Traffic Pattern-based Adaptive Routing for Intra-group Communication in Dragonfly Networks
    Faizian, Peyman
    Rahman, Md Shafayat
    Mollah, Md Atiqul
    Yuan, Xin
    Pakin, Scott
    Lang, Mike
    [J]. 2016 IEEE 24TH ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI), 2016, : 19 - 26
  • [2] Analysis of artificial neural networks for pattern-based adaptive control
    Sbarbaro, Daniel
    Johansen, Tor A.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (05): : 1184 - 1193
  • [3] On-the-fly adaptive routing for dragonfly interconnection networks
    Marina García
    Enrique Vallejo
    Ramón Beivide
    Cristóbal Camarero
    Mateo Valero
    Germán Rodríguez
    Cyriel Minkenberg
    [J]. The Journal of Supercomputing, 2015, 71 : 1116 - 1142
  • [4] A Comparative Study of SDN and Adaptive Routing on Dragonfly Networks
    Faizian, Peyman
    Mollah, Md Atiqul
    Tong, Zhou
    Yuan, Xin
    Lang, Michael
    [J]. SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [5] Adaptive Routing with Hierarchical Reinforcement Learning on Dragonfly Networks
    Cai, Xuhong
    Li, Mo
    Shi, Xingyan
    Shen, Jiayou
    Wu, Chensizhu
    Chen, Yi
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 403 - 409
  • [6] On-the-fly adaptive routing for dragonfly interconnection networks
    Garcia, Marina
    Vallejo, Enrique
    Beivide, Ramon
    Camarero, Cristobal
    Valero, Mateo
    Rodriguez, German
    Minkenberg, Cyriel
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (03): : 1116 - 1142
  • [7] Fault-Tolerant Adaptive Routing in Dragonfly Networks
    Xiang, Dong
    Li, Bing
    Fu, Yi
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2019, 16 (02) : 259 - 271
  • [8] ADAPTIVE TRAFFIC ROUTING IN TELEPHONE NETWORKS
    BEL, G
    CHEMOUIL, P
    GARCIA, JM
    LEGALL, F
    BERNUSSOU, J
    [J]. LARGE SCALE SYSTEMS IN INFORMATION AND DECISION TECHNOLOGIES, 1985, 8 (03): : 267 - 282
  • [9] Traffic Pattern-Based Content Leakage Detection for Trusted Content Delivery Networks
    Nishiyama, Hiroki
    Fomo, Desmond
    Fadlullah, Zubair Md
    Kato, Nei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (02) : 301 - 309
  • [10] ACOR: Adaptive congestion-oblivious routing in dragonfly networks
    Benito, M.
    Fuentes, P.
    Vallejo, E.
    Beivide, R.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 131 : 173 - 188