Big Data Management Performance Evaluation in Hadoop Ecosystem

被引:2
|
作者
Liu, Qing [1 ]
Fu, Yinjin [1 ]
Ni, Guiqiang [1 ]
Mei, Jianmin [1 ]
机构
[1] PLA Univ Sci & Technol, Coll Command Informat Syst, Nanjing, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Hadoop platform; big data management; distributed file system; NoSQL database; SQL-like component; performance test; DATABASES;
D O I
10.1109/BIGCOM.2017.26
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the further research of big data management, plenty of components for big data management have been developed. Based on Hadoop platform, these components provide solutions for big data management from different levels. The Hadoop ecosystem has gradually taken its shape. However, users usually lack the knowledge about the features of these components, such as the I/O pattern, capability, application scenes and so on. When dealing with some big data problems, these components are often chosen by user's experience and this will definitely lead to mismatch between the demands and the management tools. Thus, the platform cannot play out its optimal performance. Focus on this issue, this paper tested and evaluated several widely used mainstream big data management tools in Hadoop ecosystem from three levels: distributed file system, NoSQL database and SQL-like component. After the brief introduction to the typical management tools, comprehensive comparisons of these tools of the same level are carried out. The advantages and disadvantages are discussed and their performance are also tested and analyzed.
引用
收藏
页码:413 / 421
页数:9
相关论文
共 50 条
  • [1] EverAnalyzer: A Self-Adjustable Big Data Management Platform Exploiting the Hadoop Ecosystem
    Karamolegkos, Panagiotis
    Mavrogiorgou, Argyro
    Kiourtis, Athanasios
    Kyriazis, Dimosthenis
    INFORMATION, 2023, 14 (02)
  • [2] Hadoop and Spark for Data Management, Processing and Analysis of Astronomical Big Data: Applicability and Performance
    Harischandra, Lloyd
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 41 - 44
  • [3] Anomaly Detection for Big Log Data Using a Hadoop Ecosystem
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 377 - 380
  • [4] Performance Evaluation Of Association Mining In Hadoop Single Node Cluster With Big Data
    Asbern, A.
    Asha, P.
    2015 INTERNATIONAL CONFERENCED ON CIRCUITS, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2015), 2015,
  • [5] Performance Evaluation of HDFS in Big Data Management
    Dev, Dipayan
    Patgiri, Ripon
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA), 2014,
  • [6] A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem
    Sunil Kumar
    Maninder Singh
    Big Data Mining and Analytics, 2019, (04) : 240 - 247
  • [7] IoT Big Data provenance scheme using blockchain on Hadoop ecosystem
    Houshyar Honar Pajooh
    Mohammed A. Rashid
    Fakhrul Alam
    Serge Demidenko
    Journal of Big Data, 8
  • [8] IoT Big Data provenance scheme using blockchain on Hadoop ecosystem
    Pajooh, Houshyar Honar
    Rashid, Mohammed A.
    Alam, Fakhrul
    Demidenko, Serge
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [9] A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem
    Kumar, Sunil
    Singh, Maninder
    BIG DATA MINING AND ANALYTICS, 2019, 2 (04): : 240 - 247
  • [10] A Literature Review on Hadoop Ecosystem and Various Techniques of Big Data Optimization
    Singh, Vikash Kumar
    Taram, Manish
    Agrawal, Vinni
    Baghel, Bhartee Singh
    ADVANCES IN DATA AND INFORMATION SCIENCES, VOL 1, 2018, 38 : 231 - 240