Apache Spark and Apache Ignite Performance Analysis

被引:6
|
作者
Stan, Cristiana-Stefania [1 ]
Pandelica, Adrian-Eduard [1 ]
Zamfir, Vlad-Andrei [1 ]
Stan, Roxana Gabriela [1 ]
Negru, Catalin [1 ]
机构
[1] Univ Politehn Bucuresti, Dept Comp Sci, Bucharest, Romania
关键词
Big Data; Apache Spark; Ignite; performance evaluation;
D O I
10.1109/CSCS.2019.00129
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data represents an actual research topic. More and more it becomes part of people life's through different applications that are used daily, such as stock exchange, news, social media, health-care. All these applications make use of Big Data technologies for storing and processing information. There have been developed numerous technologies for implementing Big Data requirements and it is interesting to follow their strengths and weaknesses, when to use one over another and how well they perform in different situations. In this paper, we compare two frameworks Apache Spark and Ignite that are used for data processing. We perform the comparison taking into consideration the following aspects: features, implementation, architecture, and performance metrics. In order to test the performance, we used two popular applications such as word count and k-means clustering. Results show that Spark achieved better performance than Ignite.
引用
收藏
页码:726 / 733
页数:8
相关论文
共 50 条
  • [1] Performance Comparison of Apache Hadoop and Apache Spark
    Singh, Amritpal
    Khamparia, Aditya
    Luhach, Ashish Kr
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS FOR COMPUTING RESEARCH (ICAICR '19), 2019,
  • [2] Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark
    Azhir, Elham
    Hosseinzadeh, Mehdi
    Khan, Faheem
    Mosavi, Amir
    [J]. MATHEMATICS, 2022, 10 (19)
  • [3] Performance evaluation of cloud-based log file analysis with Apache Hadoop and Apache Spark
    Mavridis, Ilias
    Karatza, Helen
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 125 : 133 - 151
  • [4] Performance Prediction for Apache Spark Platform
    Wang, Kewen
    Khan, Mohammad Maifi Hasan
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 166 - 173
  • [5] Efficient Performance Prediction for Apache Spark
    Cheng, Guoli
    Ying, Shi
    Wang, Bingming
    Li, Yuhang
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 149 : 40 - 51
  • [6] Analysis of consistency for In Memory Data Grid Apache Ignite
    Tapekhin, Andrey
    Bogomolov, Igor
    Velikanov, Oleg
    [J]. 2019 IVANNIKOV MEMORIAL WORKSHOP (IVMEM 2019), 2019, : 46 - 50
  • [7] A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench
    N. Ahmed
    Andre L. C. Barczak
    Teo Susnjak
    Mohammed A. Rashid
    [J]. Journal of Big Data, 7
  • [8] A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench
    Ahmed, N.
    Barczak, Andre L. C.
    Susnjak, Teo
    Rashid, Mohammed A.
    [J]. JOURNAL OF BIG DATA, 2020, 7 (01)
  • [9] Statistical Analysis of the Performance of Four Apache Spark ML Algorithms
    Camele, Genaro
    Hasperue, Waldo
    Ronchetti, Franco
    Quiroga, Facundo Manuel
    [J]. JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2022, 22 (02): : 175 - 182
  • [10] Spectral Graph Analysis with Apache Spark
    Sutic, Davor
    Varga, Ervin
    [J]. ICOMS 2018: 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND STATISTICS, 2018, : 84 - 88