An Adaptive Tuning Strategy on Spark Based on In-memory Computation Characteristics

被引:0
|
作者
Zhao, Yao [1 ]
Hu, Fei [1 ]
Chen, Haopeng [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, 800 Dongchuan Rd, Shanghai, Peoples R China
关键词
Spark; adaptive tuning; in-memory computation; serialization;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
We present an adaptive tuning method to improve Spark performance, especially for its in-memory computation. This manner serves one purpose: making a better use of memory reasonably through adaptively adopting suitable category based on Spark application runtime statistics on different working sets. This solution works in two steps. Firstly, it collects run-time statistics dynamically and stores them in round-robin structures to save memory. Secondly, it can change system storage category based on these statistics. Additionally we focus on serialization strategy optimization. For this purpose we test Spark integrated serialization algorithms: Java and Kryo serialization algorithms, and make a comparison of their performance. In order to gain flexibility we change Spark serialization mechanism by setting the default serialization unit from one RDD to one block. In this way, for the case which RDD has huge amount of blocks our solution can use different serialization algorithms to serialize different blocks in one RDD. We show that our solution is expressive enough to obtain 2x speedup than original Spark when there is inadequate memory resource.
引用
收藏
页码:484 / 488
页数:5
相关论文
共 50 条
  • [1] GPU in-memory processing using Spark for iterative computation
    Hong, Sumin
    Choi, Woohyuk
    Jeong, Won-Ki
    [J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 31 - 41
  • [2] Performance Analysis and Auto-tuning for SPARK in-memory analytics
    Nikitopoulou, Dimitra
    Masouros, Dimosthenis
    Xydist, Sotirios
    Soudris, Dimitrios
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 76 - 81
  • [3] In-Memory Computation With Improved Linearity Using Adaptive Sparsity-Based Compact Thermometric Code
    Saragada, Prasanna Kumar
    Das, Bishnu Prasad
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (10) : 1473 - 1483
  • [4] Trends and Opportunities for SRAM Based In-Memory and Near-Memory Computation
    Srinivasa, Srivatsa
    Ramanathan, Akshay Krishna
    Sundaram, Jainaveen
    Kurian, Dileep
    Gopal, Srinivasan
    Jain, Nilesh
    Srinivasan, Anuradha
    Iyer, Ravi
    Narayanan, Vijaykrishnan
    Karnik, Tanay
    [J]. PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 547 - 552
  • [5] On the Implications of Heterogeneous Memory Tiering on Spark In-Memory Analytics
    Katsaragakis, Manolis
    Masouros, Dimosthenis
    Papadopoulos, Lazaros
    Catthoor, Francky
    Soudris, Dimitrios
    [J]. 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 945 - 952
  • [6] ReMeCo: Reliable memristor-based in-memory neuromorphic computation
    BanaGozar, Ali
    Shadmehri, Seyed Hossein Hashemi
    Stuijk, Sander
    Kamal, Mehdi
    Afzali-Kusha, Ali
    Corporaal, Henk
    [J]. 2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 396 - 401
  • [7] ReRAM-based In-Memory Computation of Galois Field arithmetic
    Mandal, Swagata
    Bhattacharjee, Debjyoti
    Tavva, Yaswanth
    Chattopadhyay, Anupam
    [J]. PROCEEDINGS OF THE 2018 26TH IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2018, : 1 - 6
  • [8] Adaptive memory reservation strategy for heavy workloads in the Spark environment
    Li, Bohan
    He, Xin
    Yu, Junyang
    Wang, Guanghui
    Song, Yixin
    Pan, Shunjie
    Gu, Hangyu
    [J]. PeerJ Computer Science, 2024, 10
  • [9] An In-memory Database Implementation Technique based on Separation of Management, Computation and Storage
    Zhang, Yan-Song
    Han, Rui-Chen
    Liu, Zhuan
    Zhang, Yu
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (04): : 761 - 779
  • [10] In-Memory Computation Based Mapping of Keccak-f Hash Function
    Kingra, Sandeep Kaur
    Parmar, Vivek
    Suri, Manan
    [J]. FRONTIERS IN NANOTECHNOLOGY, 2022, 4