Dagger: Efficient and Fast RPCs in Cloud Microservices with Near-Memory Reconfigurable NICs

被引:28
|
作者
Lazarev, Nikita [1 ]
Xiang, Shaojie [1 ]
Adit, Neil [1 ]
Zhang, Zhiru [1 ]
Delimitrou, Christina [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
关键词
End-host networking; cloud computing; datacenters; RPC frameworks; microservices; smartNICs; FPGAs; cache-coherent FPGAs;
D O I
10.1145/3445814.3446696
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ongoing shift of cloud services from monolithic designs to microservices creates high demand for efficient and high performance datacenter networking stacks, optimized for fine-grained workloads. Commodity networking systems based on software stacks and peripheral NICs introduce high overheads when it comes to delivering small messages. We present Dagger, a hardware acceleration fabric for cloud RPCs based on FPGAs, where the accelerator is closely-coupled with the host processor over a configurable memory interconnect. The three key design principle of Dagger are: (1) offloading the entire RPC stack to an FPGA-based NIC, (2) leveraging memory interconnects instead of PCIe buses as the interface with the host CPU, and (3) making the acceleration fabric reconfigurable, so it can accommodate the diverse needs of microservices. We show that the combination of these principles significantly improves the efficiency and performance of cloud RPC systems while preserving their generality. Dagger achieves 1.3 - 3.8x higher per-core RPC throughput compared to both highly-optimized software stacks, and systems using specialized RDMA adapters. It also scales up to 84 Mrps with 8 threads on 4 CPU cores, while maintaining state-of-the-art mu s-scale tail latency. We also demonstrate that large third-party applications, like memcached and MICA KVS, can be easily ported on Dagger with minimal changes to their codebase, bringing their median and tail KVS access latency down to 2.8 - 3.5 us and 5.4 - 7.8 us, respectively. Finally, we show that Dagger is beneficial for multi-tier end-to-end microservices with different threading models by evaluating it using an 8-tier application implementing a flight check-in service.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 28 条
  • [1] Dagger: Towards Efficient RPCs in Cloud Microservices With Near-Memory Reconfigurable NICs
    Lazarev, Nikita
    Adit, Neil
    Xiang, Shaojie
    Zhang, Zhiru
    Delimitrou, Christina
    IEEE COMPUTER ARCHITECTURE LETTERS, 2020, 19 (02) : 134 - 138
  • [2] Performance Estimation and Prototyping of Reconfigurable Near-Memory Computing Systems
    Iskandar, Veronia
    Abd El Ghany, Mohamed A.
    Goehringer, Diana
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 357 - 358
  • [3] Accelerating Weather Prediction Using Near-Memory Reconfigurable Fabric
    Singh, Gagandeep
    Diamantopoulos, Dionysios
    Gomez-Luna, Juan
    Hagleitner, Christoph
    Stuijk, Sander
    Corporaal, Henk
    Mutlu, Onur
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (04)
  • [4] GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering
    Chen, Longlong
    Zhu, Jianfeng
    Peng, Guiqiang
    Liu, Mingxu
    Wei, Shaojun
    Liu, Leibo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (12) : 3059 - 3072
  • [5] HeNCoG: A Heterogeneous Near-memory Computing Architecture for Energy Efficient GCN Acceleration
    Hwang, Seung-Eon
    Song, Duyeong
    Park, Jongsun
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [6] Recryptor: A Reconfigurable Cryptographic Cortex-MO Processor With In-Memory and Near-Memory Computing for IoT Security
    Zhang, Yiqun
    Xu, Li
    Dong, Qing
    Wang, Jingcheng
    Blaauw, David
    Sylvester, Dennis
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2018, 53 (04) : 995 - 1005
  • [7] Leakage Reuse for Energy Efficient Near-Memory Computing of Heterogeneous DNN Accelerators
    Hossain, Md Shazzad
    Savidis, Ioannis
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2021, 11 (04) : 762 - 775
  • [8] NEMO-CNN: An Efficient Near-Memory Accelerator for Convolutional Neural Networks
    Brown, Grant
    Tenace, Valerio
    Gaillardon, Pierre-Emmanuel
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 57 - 60
  • [9] FAFNIR: Accelerating Sparse Gathering by Using Efficient Near-Memory Intelligent Reduction
    Asgari, Bahar
    Hadidi, Ramyad
    Cao, Jiashen
    Shim, Da Eun
    Lim, Sung-Kyu
    Kim, Hyesoon
    2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 908 - 920
  • [10] Energy-Efficient Bayesian Inference Using Near-Memory Computation with Memristors
    Turck, C.
    Harabi, K. -E.
    Hirtzlin, T.
    Vianello, E.
    Laurent, R.
    Droulez, J.
    Bessiere, P.
    Bocquet, M.
    Portal, J. -M.
    Querlioz, D.
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,