Big Data Management Performance Evaluation in Hadoop Ecosystem

被引:2
|
作者
Liu, Qing [1 ]
Fu, Yinjin [1 ]
Ni, Guiqiang [1 ]
Mei, Jianmin [1 ]
机构
[1] PLA Univ Sci & Technol, Coll Command Informat Syst, Nanjing, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Hadoop platform; big data management; distributed file system; NoSQL database; SQL-like component; performance test; DATABASES;
D O I
10.1109/BIGCOM.2017.26
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the further research of big data management, plenty of components for big data management have been developed. Based on Hadoop platform, these components provide solutions for big data management from different levels. The Hadoop ecosystem has gradually taken its shape. However, users usually lack the knowledge about the features of these components, such as the I/O pattern, capability, application scenes and so on. When dealing with some big data problems, these components are often chosen by user's experience and this will definitely lead to mismatch between the demands and the management tools. Thus, the platform cannot play out its optimal performance. Focus on this issue, this paper tested and evaluated several widely used mainstream big data management tools in Hadoop ecosystem from three levels: distributed file system, NoSQL database and SQL-like component. After the brief introduction to the typical management tools, comprehensive comparisons of these tools of the same level are carried out. The advantages and disadvantages are discussed and their performance are also tested and analyzed.
引用
收藏
页码:413 / 421
页数:9
相关论文
共 50 条
  • [21] Design of a vertical search engine for synchrotron data: a big data approach using Hadoop ecosystem
    Ali Khaleghi
    Kamran Mahmoudi
    Sonia Mozaffari
    SN Applied Sciences, 2019, 1
  • [22] Design of a vertical search engine for synchrotron data: a big data approach using Hadoop ecosystem
    Khaleghi, Ali
    Mahmoudi, Kamran
    Mozaffari, Sonia
    SN APPLIED SCIENCES, 2019, 1 (12):
  • [23] Optimizing Hadoop Performance for Big Data Analytics in Smart Grid
    Khan, Mukhtaj
    Huang, Zhengwen
    Li, Maozhen
    Taylor, Gareth A.
    Ashton, Phillip M.
    Khan, Mushtaq
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [24] An Approach to Enhance the Performance of Hadoop MapReduce Framework for Big Data
    Chandra, Subhash
    Motwani, Deepak
    2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 178 - 182
  • [25] Big Data Migration and Sentiment Analysis of Real Time Events Using Hadoop Ecosystem
    Chandana, R.
    Harshitha, D.
    Meenakshi
    Ramachandra, A. C.
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 764 - 770
  • [26] A Solution to Combat Cyber security Threats Involving Big Data Analytics in the Hadoop Ecosystem
    Lnenicka, Martin
    Capek, Jan
    Komarkova, Jitka
    Machova, Renata
    Cermakova, Ivana
    VISION 2020: SUSTAINABLE ECONOMIC DEVELOPMENT, INNOVATION MANAGEMENT, AND GLOBAL GROWTH, VOLS I-IX, 2017, 2017, : 1804 - 1812
  • [27] Big Data Management Processing with Hadoop MapReduce and Spark Technology: A Comparison
    Verma, Ankush
    Mansuri, Ashik Hussain
    Jain, Neelesh
    2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [28] Enhancing Dataset Processing in Hadoop YARN Performance for Big Data Applications
    Al-Absi, Ahmed Abdulhakim
    Kang, Dae-Ki
    Kim, Myong-Jong
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURE INFORMATION TECHNOLOGY, VOL 2, 2016, 354 : 9 - 15
  • [29] Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing
    Lim, JongBeom
    Ahnh, Jong-Suk
    Lee, Kang-Woo
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2314 - 2319
  • [30] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,