NB-Cache: Non-Blocking In-Network Caching for High-Performance Content Routers

被引:1
|
作者
Pan, Tian [1 ]
Lin, Xingchen [1 ]
Song, Enge [1 ]
Xu, Cheng [1 ]
Zhang, Jiao [1 ]
Li, Hao [2 ]
Lv, Jianhui [3 ]
Huang, Tao [1 ]
Liu, Bin [4 ]
Zhang, Beichuan [5 ]
机构
[1] Beijing Univ Posts & Telecommun BUPT, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian 710049, Peoples R China
[3] Tsinghua Univ, Int Grad Sch Shenzhen, Shenzhen 518057, Peoples R China
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[5] Univ Arizona, Dept Comp Sci, Tucson, AZ 85721 USA
基金
中国国家自然科学基金;
关键词
Pipelines; Load modeling; Data models; Throughput; Queueing analysis; Prototypes; Markov processes; ICN; content router; bottleneck bypassing; non-blocking I; O; Bloom filter; N-queue bypass model; TANDEM QUEUE;
D O I
10.1109/TNET.2021.3083599
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Information-Centric Networking (ICN) provides scalable and efficient content distribution at the Internet scale due to in-network caching and native multicast. To support these features, a content router needs high performance at its data plane, which consists of three forwarding steps: checking the Content Store (CS), then the Pending Interest Table (PIT), and finally the Forwarding Information Base (FIB). In this work, we build an analytical model of the router and identify that CS is the actual bottleneck. Then, we propose a novel mechanism called "NB-Cache" to address CS's performance issue from a network-wide point of view. In NB-Cache, when packets arrive at a router whose CS is fully loaded, instead of being blocked and waiting for the CS, these packets are forwarded to the next-hop router, whose CS may not be fully loaded. This approach essentially utilizes Content Stores of all the routers along the forwarding path in parallel rather than checking each CS sequentially. NB-Cache follows a design pattern of on-demand load balancing and can be formulated into a non-trivial N-queue bypass model. We use the Markov chain to establish its theoretical base and find an algorithm for automated transition rate matrix generation. Experiments show significant improvement of data plane performance: 70% reduction in round-trip time (RTT) and 130% increase in throughput. NB-Cache decouples the fast packet forwarding from the slower content retrieval thus substantially reducing CS's heavy dependency on fast but expensive memory.
引用
收藏
页码:1976 / 1989
页数:14
相关论文
共 50 条
  • [21] Combining Popularity and Locality to Enhance In-Network Caching Performance and Mitigate Pollution Attacks in Content-Centric Networking
    Zhang, Guozhi
    Liu, Jiqiang
    Chang, Xiaolin
    Chen, Zhi
    [J]. IEEE ACCESS, 2017, 5 : 19012 - 19022
  • [22] BP-NUCA: CACHE PRESSURE-AWARE MIGRATION FOR HIGH-PERFORMANCE CACHING IN CMPS
    Jia, Xiaomin
    Jiang, Jiang
    Wang, Yongwen
    Qi, Shubo
    Zhao, Tianlei
    Fu, Guitao
    Zhang, Minxuan
    [J]. COMPUTING AND INFORMATICS, 2011, 30 (05) : 1037 - 1060
  • [23] Scalable and Flexible High-Performance In-Network Processing of Hash Joins in Distributed Databases
    Wirth, Johannes
    Hofmann, Jaco A.
    Thostrup, Lasse
    Binnig, Carsten
    Koch, Andreas
    [J]. 2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 212 - 220
  • [24] Using Non-blocking I/O Operations in High Performance Computing to Reduce Execution Times
    Buettner, David
    Kunkel, Julian
    Ludwig, Thomas
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 134 - 142
  • [25] High-performance implementation of in-network traffic pacing for small-buffer networks
    Hanay, Y. Sinan
    Dwaraki, Abhishek
    Hu, Kekai
    Wolf, Tilman
    [J]. COMPUTER COMMUNICATIONS, 2013, 36 (13) : 1450 - 1459
  • [26] High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits
    Shankar, Dipti
    Lu, Xiaoyi
    Islam, Nusrat
    Wasi-ur-Rahman, Md.
    Panda, Dhabaleswar K.
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 393 - 402
  • [27] High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT
    Kandalla, Krishna
    Subramoni, Hari
    Tomko, Karen
    Pekurovsky, Dmitry
    Sur, Sayantan
    Panda, Dhabaleswar K.
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2011, 26 (3-4): : 237 - 246
  • [28] A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-Blocking Alltoallv Collective on Multi-core Systems
    Kandalla, K.
    Subramoni, H.
    Tomko, K.
    Pekurovsky, D.
    Panda, D. K.
    [J]. 2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 611 - 620
  • [29] High Performance and Enhanced Scalability for Parallel Applications using MPI-3's non-blocking Collectives
    Pericherla, Surendra Varma
    Vadhiyar, Sathish
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2403 - 2407
  • [30] A RISC-V in-network accelerator for flexible high-performance low-power packet processing
    Di Girolamo, Salvatore
    Kurth, Andreas
    Calotoiu, Alexandru
    Benz, Thomas
    Schneider, Timo
    Beranek, Jakub
    Benini, Luca
    Hoefler, Torsten
    [J]. 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 958 - 971