Optimization Techniques for a Distributed In-Memory Computing Platform by Leveraging SSD

被引:1
|
作者
Choi, June [1 ]
Lee, Jaehyun [1 ]
Kim, Jik-Soo [2 ]
Lee, Jaehwan [1 ]
机构
[1] Korea Aerosp Univ, Sch Elect & Informat Engn, Goyang Si 10540, South Korea
[2] Myongji Univ, Dept Comp Engn, Yongin 03674, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 18期
基金
新加坡国家研究基金会;
关键词
Apache Spark; memory management; solid-state drive; in-memory processing framework; performance; PageRank; transitive closure; TeraSort; k-means clustering; !text type='Java']Java[!/text] Virtual Machine heap configuration; resilient distributed dataset; MAPREDUCE;
D O I
10.3390/app11188476
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, we present several optimization strategies that can improve the overall performance of the distributed in-memory computing system, "Apache Spark". Despite its distributed memory management capability for iterative jobs and intermediate data, Spark has a significant performance degradation problem when the available amount of main memory (DRAM, typically used for data caching) is limited. To address this problem, we leverage an SSD (solid-state drive) to supplement the lack of main memory bandwidth. Specifically, we present an effective optimization methodology for Apache Spark by collectively investigating the effects of changing the capacity fraction ratios of the shuffle and storage spaces in the "Spark JVM Heap Configuration" and applying different "RDD Caching Policies" (e.g., SSD-backed memory caching). Our extensive experimental results show that by utilizing the proposed optimization techniques, we can improve the overall performance by up to 42%.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] In-memory computing on a photonic platform
    Rios, Carlos
    Youngblood, Nathan
    Cheng, Zengguang
    Le Gallo, Manuel
    Pernice, Wolfram H. P.
    Wright, C. David
    Sebastian, Abu
    Bhaskaran, Harish
    [J]. SCIENCE ADVANCES, 2019, 5 (02):
  • [2] In-Memory Computing Architectures for Sparse Distributed Memory
    Kang, Mingu
    Shanbhag, Naresh R.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2016, 10 (04) : 855 - 863
  • [3] Distributed In-Memory Computing on Binary RRAM Crossbar
    Ni, Leibin
    Huang, Hantao
    Liu, Zichuan
    Joshi, Rajiv V.
    Yu, Hao
    [J]. ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
  • [4] Leveraging Ferroelectric Stochasticity and In-Memory Computing for DNN IP Obfuscation
    Mankali, Likhitha
    Rangarajan, Nikhil
    Chatterjee, Swetaki
    Kumar, Shubham
    Chauhan, Yogesh Singh
    Sinanoglu, Ozgur
    Amrouch, Hussam
    [J]. IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2022, 8 (02): : 102 - 110
  • [5] In-memory Distributed Matrix Computation Processing and Optimization
    Yu, Yongyang
    Tang, Mingjie
    Aref, Walid G.
    Malluhi, Qutaibah M.
    Abbas, Mostafa M.
    Ouzzani, Mourad
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1047 - 1058
  • [6] Leveraging Coding Techniques for Speeding up Distributed Computing
    Konstantinidis, Konstantinos
    Ramamoorthy, Aditya
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [7] Configuration through Optimization for In-Memory Computing Hardware and Simulators
    Li, Ke-Han
    Hsu, Chih-Fan
    Lin, Yu-Sheng
    Chien, Shao-Yi
    Chen, Wei-Chao
    [J]. 2022 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2022, : 79 - 84
  • [8] IMFSSC: An In-Memory Distributed File System Framework for Super Computing
    Li, Binyang
    Li, Bo
    Liu, Ming
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 132 - 137
  • [9] Accelerating Event Processing for Security Analytics on a Distributed In-Memory Platform
    Jaeger, David
    Cheng, Feng
    Meinel, Christoph
    [J]. 2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 634 - 643
  • [10] Optimization of Projected Phase Change Memory for Analog In-Memory Computing Inference
    Li, Ning
    Mackin, Charles
    Chen, An
    Brew, Kevin
    Philip, Timothy
    Simon, Andrew
    Saraf, Iqbal
    Han, Jin-Ping
    Sarwat, Syed Ghazi
    Burr, Geoffrey W.
    Rasch, Malte
    Sebastian, Abu
    Narayanan, Vijay
    Saulnier, Nicole
    [J]. ADVANCED ELECTRONIC MATERIALS, 2023, 9 (06)