P4GPU: Acceleration of Programmable Data Plane Using a CPU-GPU Heterogeneous Architecture

被引:0
|
作者
Li, Peilong [1 ]
Luo, Yan [1 ]
机构
[1] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01852 USA
关键词
Programmable Data Plane; Heterogeneous Architecture; Packet Processing; P4; IP LOOKUP;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The programmability of the network data plane has become one of the most desirable features within the context of software defined networks, with P4 serving as a domain-specific language for defining data plane processing. In this work, we are motivated to address the challenges of mapping a P4 defined data plane to a heterogeneous programmable hardware architecture consisting of both a CPU and a GPU, which includes a salient parallel SIMD architecture for processing network flows. We first design a toolset that can be used to map a P4 program onto the proposed architecture. We then optimize the GPU kernel designs for "match-action" primitives and present latency-hiding techniques to reduce the overheads of CPU/GPU communication. In addition, load balancing is investigated to maximize the utilization of CPU and GPU resources. Our toolset and optimizations allow a P4 program to render promising performance on the given heterogeneous architecture. Specifically, the experimental results collected on our prototype systems show that the automatically configured GPU kernels achieve scalable lookup and classification speeds with 420 million IP lookups per second, and more than 60 million classifications per second (for 4K firewall rules).
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条
  • [21] A Survey on Task Scheduling of CPU-GPU Heterogeneous Cluster
    ZHOU Yiheng
    ZENG Wei
    ZHENG Qingfang
    LIU Zhilong
    CHEN Jianping
    ZTE Communications, 2024, 22 (03) : 83 - 90
  • [22] Image Noise Removal on Heterogeneous CPU-GPU Configurations
    Sanchez, Maria G.
    Vidal, Vicente
    Arnal, Josep
    Vidal, Anna
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 2219 - 2229
  • [23] Analyzing OpenCL 2.0 Workloads Using a Heterogeneous CPU-GPU Simulator
    Wang, Li
    Tsai, Ren-Wei
    Wang, Shao-Chung
    Chen, Kun-Chih
    Wang, Po-Han
    Cheng, Hsiang-Yun
    Lee, Yi-Chung
    Shu, Sheng-Jie
    Yang, Chun-Chieh
    Hsu, Min-Yih
    Kan, Li-Chen
    Lee, Chao-Lin
    Yu, Tzu-Chieh
    Peng, Rih-Ding
    Yang, Chia-Lin
    Hwang, Yuan-Shin
    Lee, Jenq-Kuen
    Tsao, Shiao-Li
    Ouhyoung, Ming
    2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 127 - 128
  • [24] CPU-GPU heterogeneous code acceleration of a finite volume Computational Fluid Dynamics solver
    Xue, Weicheng
    Wang, Hongyu
    Roy, Christopher J.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 158 : 367 - 377
  • [25] Performance Optimization for CPU-GPU Heterogeneous Parallel System
    Wang, Yanhua
    Qiao, Jianzhong
    Lin, Shukuan
    Zhao, Tinglei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1259 - 1266
  • [26] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403
  • [27] Heterogeneous parallel_for Template for CPU-GPU Chips
    Navarro, Angeles
    Corbera, Francisco
    Rodriguez, Andres
    Vilches, Antonio
    Asenjo, Rafael
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (02) : 213 - 233
  • [28] A Flexible Scheduling Framework for Heterogeneous CPU-GPU Clusters
    Sajjapongse, Kittisak
    Agarwal, Tejaswi
    Becchi, Michela
    2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,
  • [29] Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
    Yu, Yuanhang
    Wen, Dong
    Zhang, Ying
    Wang, Xiaoyang
    Zhang, Wenjie
    Lin, Xuemin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1871 - 1876
  • [30] Accelerating Static Timing Analysis Using CPU-GPU Heterogeneous Parallelism
    Guo, Zizheng
    Huang, Tsung-Wei
    Lin, Yibo
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (12) : 4973 - 4984