NB-Cache: Non-Blocking In-Network Caching for High-Performance Content Routers

被引：1

作者：

Pan, Tian ^{[1
]}

Lin, Xingchen ^{[1
]}

Song, Enge ^{[1
]}

Xu, Cheng ^{[1
]}

Zhang, Jiao ^{[1
]}

Li, Hao ^{[2
]}

Lv, Jianhui ^{[3
]}

Huang, Tao ^{[1
]}

Liu, Bin ^{[4
]}

Zhang, Beichuan ^{[5
]}

机构：

[1] Beijing Univ Posts & Telecommun BUPT, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

[2] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian 710049, Peoples R China

[3] Tsinghua Univ, Int Grad Sch Shenzhen, Shenzhen 518057, Peoples R China

[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

[5] Univ Arizona, Dept Comp Sci, Tucson, AZ 85721 USA

来源：

IEEE-ACM TRANSACTIONS ON NETWORKING | 2021年 / 29卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Pipelines; Load modeling; Data models; Throughput; Queueing analysis; Prototypes; Markov processes; ICN; content router; bottleneck bypassing; non-blocking I; O; Bloom filter; N-queue bypass model; TANDEM QUEUE;

D O I：

10.1109/TNET.2021.3083599

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Information-Centric Networking (ICN) provides scalable and efficient content distribution at the Internet scale due to in-network caching and native multicast. To support these features, a content router needs high performance at its data plane, which consists of three forwarding steps: checking the Content Store (CS), then the Pending Interest Table (PIT), and finally the Forwarding Information Base (FIB). In this work, we build an analytical model of the router and identify that CS is the actual bottleneck. Then, we propose a novel mechanism called "NB-Cache" to address CS's performance issue from a network-wide point of view. In NB-Cache, when packets arrive at a router whose CS is fully loaded, instead of being blocked and waiting for the CS, these packets are forwarded to the next-hop router, whose CS may not be fully loaded. This approach essentially utilizes Content Stores of all the routers along the forwarding path in parallel rather than checking each CS sequentially. NB-Cache follows a design pattern of on-demand load balancing and can be formulated into a non-trivial N-queue bypass model. We use the Markov chain to establish its theoretical base and find an algorithm for automated transition rate matrix generation. Experiments show significant improvement of data plane performance: 70% reduction in round-trip time (RTT) and 130% increase in throughput. NB-Cache decouples the fast packet forwarding from the slower content retrieval thus substantially reducing CS's heavy dependency on fast but expensive memory.

引用

页码：1976 / 1989

页数：14

共 50 条

[21] Combining Popularity and Locality to Enhance In-Network Caching Performance and Mitigate Pollution Attacks in Content-Centric Networking
Zhang, Guozhi
Liu, Jiqiang
Chang, Xiaolin
Chen, Zhi
[J]. IEEE ACCESS, 2017, 5 : 19012 - 19022
[22] BP-NUCA: CACHE PRESSURE-AWARE MIGRATION FOR HIGH-PERFORMANCE CACHING IN CMPS
Jia, Xiaomin
Jiang, Jiang
Wang, Yongwen
Qi, Shubo
Zhao, Tianlei
Fu, Guitao
Zhang, Minxuan
[J]. COMPUTING AND INFORMATICS, 2011, 30 (05) : 1037 - 1060
[23] Scalable and Flexible High-Performance In-Network Processing of Hash Joins in Distributed Databases
Wirth, Johannes
Hofmann, Jaco A.
Thostrup, Lasse
Binnig, Carsten
Koch, Andreas
[J]. 2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 212 - 220
[24] Using Non-blocking I/O Operations in High Performance Computing to Reduce Execution Times
Buettner, David
Kunkel, Julian
Ludwig, Thomas
[J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 134 - 142
[25] High-performance implementation of in-network traffic pacing for small-buffer networks
Hanay, Y. Sinan
Dwaraki, Abhishek
Hu, Kekai
Wolf, Tilman
[J]. COMPUTER COMMUNICATIONS, 2013, 36 (13) : 1450 - 1459
[26] High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits
Shankar, Dipti
Lu, Xiaoyi
Islam, Nusrat
Wasi-ur-Rahman, Md.
Panda, Dhabaleswar K.
[J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 393 - 402
[27] High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT
Kandalla, Krishna
Subramoni, Hari
Tomko, Karen
Pekurovsky, Dmitry
Sur, Sayantan
Panda, Dhabaleswar K.
[J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2011, 26 (3-4): : 237 - 246
[28] A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-Blocking Alltoallv Collective on Multi-core Systems
Kandalla, K.
Subramoni, H.
Tomko, K.
Pekurovsky, D.
Panda, D. K.
[J]. 2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 611 - 620
[29] High Performance and Enhanced Scalability for Parallel Applications using MPI-3's non-blocking Collectives
Pericherla, Surendra Varma
Vadhiyar, Sathish
[J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2403 - 2407
[30] A RISC-V in-network accelerator for flexible high-performance low-power packet processing
Di Girolamo, Salvatore
Kurth, Andreas
Calotoiu, Alexandru
Benz, Thomas
Schneider, Timo
Beranek, Jakub
Benini, Luca
Hoefler, Torsten
[J]. 2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 958 - 971

← 1 2 3 4 5 →