Fast and Flexible: Parallel Packet Processing with GPUs and Click

被引:0
|
作者
Sun, Weibin [1 ]
Ricci, Robert [1 ]
机构
[1] Univ Utah, Flux Res Grp, Sch Comp, Salt Lake City, UT 84112 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We introduce Snap, a framework for packet processing that outperforms traditional software routers by exploiting the parallelism available on modern GPUs. While obtaining high performance, it remains extremely flexible, with packet processing tasks implemented as simple modular elements that are composed to build fully functional routers and switches. Snap is based on the Click modular router, which it extends by adding new architectural features that support batched packet processing, memory structures optimized for offloading to coprocessors, and asynchronous scheduling with in-order completion. We show that Snap can run complex pipelines at high speeds on commodity PC hardware by building an IP router incorporating both an IDS-like full-packet string matcher and an SDN-like packet classifier. In this configuration, Snap is able to forward 40 million packets per second, saturating four 10 Gbps NICs at packet sizes as small as 128 byes. This represents an increase in throughput of nearly 4x over the baseline Click running comparable elements on the CPU.
引用
收藏
页码:25 / 35
页数:11
相关论文
共 50 条
  • [21] TupleMerge: Fast Software Packet Processing for Online Packet Classification
    Daly, James
    Bruschi, Valerio
    Linguaglossa, Leonardo
    Pontarelli, Salvatore
    Rossi, Dario
    Tollet, Jerome
    Torng, Eric
    Yourtchenko, Andrew
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2019, 27 (04) : 1417 - 1431
  • [22] Efficient hierarchical hash tree for OpenFlow packet classification with fast updates on GPUs
    Lin, Yu-Hsiang
    Shih, Wen-Chi
    Chang, Yeim-Kuan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 167 : 136 - 147
  • [23] Fast evolutionary image processing using multi-GPUs
    Ando, Jun
    Nagao, Tomoharu
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 1518 - +
  • [24] Fast evaluation of Helmholtz potential on graphics processing units (GPUs)
    Li, Shaojing
    Livshitz, Boris
    Lomakin, Vitaliy
    JOURNAL OF COMPUTATIONAL PHYSICS, 2010, 229 (22) : 8463 - 8483
  • [25] An efficient parallel-network packet pattern-matching approach using GPUs
    Hung, Che-Lun
    Lin, Chun-Yuan
    Wang, Hsiao-Hsi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2014, 60 (05) : 431 - 439
  • [26] Parallel Fast Walsh Transform Algorithm and Its Implementation with CUDA on GPUs
    Bikov, Dusan
    Bouyukliev, Iliya
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (05) : 21 - 43
  • [27] Accelerated CDOCKER with GPUs, Parallel Simulated Annealing, and Fast Fourier Transforms
    Ding, Xinqiang
    Wu, Yujin
    Wang, Yanming
    Vilseck, Jonah Z.
    Brooks, Charles L., III
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2020, 16 (06) : 3910 - 3919
  • [28] Parallel Selectivity Estimation for Optimizing Multidimensional Spatial Join Processing on GPUs
    Zhang, Jianting
    You, Simin
    Gruenwald, Le
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1591 - 1598
  • [29] FlowContext: Flexible platform for multigigabit stateful packet processing
    Kosek, Martin
    Korenek, Jan
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 804 - 807
  • [30] A framework for flexible packet processing in heterogeneous sensor networks
    Leogrande, Marco
    Pastrone, Claudio
    Spirito, Maurizio
    Tornasi, Riccardo
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, WORKSHOP PAPERS, VOL 2, 2007, : 377 - +