R3S: RDMA-based RDD Remote Storage for Spark

被引:1
|
作者
Yan, Xinan [1 ]
Wong, Bernard [1 ]
Choy, Sharon [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Spark; RDMA; Adaptive Storage;
D O I
10.1145/3008167.3008171
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the past few years, Spark has seen rapid adoption by organizations that require high-performance data processing. This growth stems from its significant performance improvement over past frameworks such as Hadoop and MapReduce. Spark achieves its performance improvement by aggressively storing intermediate computation in memory in order to avoid slow disk accesses. However, Spark's performance advantage disappears when Spark nodes do not have enough local memory to store the intermediate computations, requiring results to be written to disk or recomputed. In this paper, we introduce R3S, an RDMA-based in-memory RDD storage layer for Spark. R3S leverages high-bandwidth networks and low-latency one-sided RDMA operations to allow Spark nodes to efficiently access intermediate output from a remote node. R3S can use this flexibility to treat the memory available on the different machines in the cluster as a single memory pool. This enables more efficient use of memory and can reduce job completion time. R3S can also work together with the cluster manager to add storage-only nodes to the Spark cluster. This can benefit workloads where adding memory has a far larger impact on performance than adding processing units. Our prototype reduces job completion time by 29% compared to Spark using Tachyon as the RDD storage layer on a machine learning benchmark.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Survey on RDMA-Based Distributed Storage Systems
    Chen, Youmin
    Lu, Youyou
    Luo, Shengmei
    Shu, Jiwu
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (02): : 227 - 239
  • [2] RDMA-based Direct Transfer of File Data to Remote Page Cache
    Sasaki, Shin
    Takahashi, Kazushi
    Oyama, Yoshihiro
    Tatebe, Osamu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, 2015, : 214 - 225
  • [3] Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache
    Wei, Xingda
    Chen, Rong
    Chen, Haibo
    [J]. PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), 2020, : 117 - 135
  • [4] Seriema: RDMA-based Remote Invocation with a Case-Study on Monte-Carlo Tree Search
    Mendes, Hammurabi
    Wiedenbeck, Bryce
    O'Neill, Aidan
    [J]. 2022 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2022), 2022, : 11 - 20
  • [5] XStore: Fast RDMA-Based Ordered Key-Value Store Using Remote Learned Cache
    Wei, Xingda
    Chen, Rong
    Chen, Haibo
    Zang, Binyu
    [J]. ACM TRANSACTIONS ON STORAGE, 2021, 17 (03)
  • [6] 系泊链用圆钢R3S开发实践
    王前
    张小华
    尹修刚
    王刚
    肖波
    韩红艳
    [J]. 现代交通与冶金材料, 2010, 38 (02) : 19 - 20
  • [7] The R3s: A Modern Approach to Predicting Substance Recidivism in Liver Transplant Candidates
    Aljudaibi, Bandar
    Leistman, Samantha C.
    Melaragno, Jennifer
    Dokus, M. Katherine
    Martens, John A.
    Salter, Mary A.
    Hutchinson, David J.
    Nickels, Mark W.
    [J]. HEPATOLOGY, 2018, 68 : 153A - 154A
  • [8] TRIALKYLSULFURIO RADICALS R3S - OCTET FLARE ON SULFUR BY AGGRESSIVE RADICALS - PROBLEM OF IDENTITY REACTIONS
    SCHMIDT, U
    HOCHRAIN.A
    NIKIFORO.A
    [J]. TETRAHEDRON LETTERS, 1970, (42) : 3677 - &
  • [9] R3S级C型梨形卸扣的热处理工艺改进
    张明
    张少宗
    李剑
    邵云亮
    [J]. 热处理技术与装备, 2013, 34 (06) : 20 - 22
  • [10] The [R3S][HgI3] family : contributions of P. C. Ray and some later developments
    Chakravorty, Animesh
    [J]. JOURNAL OF THE INDIAN CHEMICAL SOCIETY, 2013, 90 (12) : 2165 - 2167