A Systematic Literature Review of Big Data and the Hadoop frameworks

被引:0
|
作者
Naidu, Devishree [1 ]
Thakur, Adi [1 ]
机构
[1] Shri Ramdeobaba Coll Engn & Management, Comp Sci & Engn Dept, Nagpur, Maharashtra, India
关键词
Big data; Flume; Map Reduce; Hadoop Ecosystem; Hadoop frameworks;
D O I
10.9756/INT-JECSE/V14I2.287
中图分类号
G76 [特殊教育];
学科分类号
040109 ;
摘要
Big data is a term to define the huge amount of data gathered mostly through new data sources like Twitter, Instagram, Facebook etc. This data is important as its analysis is changing how major businesses work and has the ability to provide the knowledge required to cut back business costs. Most firms are currently using this technology to accurately find trends and predict future events in their various industries. The challenge lies in finding the best way to process, analyze and draw useful insights from this data. This data cannot be handled efficiently by the traditional data management tools and hence required some other advanced data technologies. This is mainly because of its unstructured nature and the five V's - Volume, Variety, Velocity, Value, and Veracity which we mostly use to define big data are the main reason why its handling is a major challenge. Since this data is growing at an exponential rate, it was a necessity a develop technologies to address it. Hadoop, Map Reduce, and No SQL are the major three technologies that were developed to handle the complexities of big data and manage it reliably. This paper discusses the several technologies based on Hadoop which is altogether called the Hadoop ecosystem and their uses in analyzing big data.
引用
收藏
页码:2969 / 2973
页数:5
相关论文
共 50 条
  • [1] A Literature Review on Hadoop Ecosystem and Various Techniques of Big Data Optimization
    Singh, Vikash Kumar
    Taram, Manish
    Agrawal, Vinni
    Baghel, Bhartee Singh
    [J]. ADVANCES IN DATA AND INFORMATION SCIENCES, VOL 1, 2018, 38 : 231 - 240
  • [2] A Systematic Review On Cardiovascular diseases using Big-Data by Hadoop
    Thakur, Sanjeev
    Ramzan, Munaza
    [J]. 2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 351 - 355
  • [3] A Review on Big Data and Hadoop Security
    Khaloufi, Hayat
    Beni-Hssane, Abderrahim
    Abouelmehdi, Karim
    Saadi, Mostafa
    [J]. Networked Systems, NETYS 2016, 2016, 9944 : 386 - 386
  • [4] A Systematic Review of Distributed Deep Learning Frameworks for Big Data
    Berloco, Francesco
    Bevilacqua, Vitoantonio
    Colucci, Simona
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 242 - 256
  • [5] BIG DATA ARCHITECTURES FOR DATA LAKES: A SYSTEMATIC LITERATURE REVIEW
    Ramchand, Sonam
    Mahmood, Tariq
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1141 - 1146
  • [6] Big data analytics in healthcare: a systematic literature review
    Khanra, Sayantan
    Dhir, Amandeep
    Islam, Najmul
    Mantymaki, Matti
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2020, 14 (07) : 878 - 912
  • [7] Cleaning Big Data Streams: A Systematic Literature Review
    Alotaibi, Obaid
    Pardede, Eric
    Tomy, Sarath
    Bagui, Sikha
    Iacono, Mauro
    [J]. TECHNOLOGIES, 2023, 11 (04)
  • [8] 15 years of Big Data: a systematic literature review
    Tosi, Davide
    Kokaj, Redon
    Roccetti, Marco
    [J]. JOURNAL OF BIG DATA, 2024, 11 (01)
  • [9] Security and Privacy for Big Data: A Systematic Literature Review
    Nelson, Boel
    Olovsson, Tomas
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3693 - 3702
  • [10] Manufacturing big data ecosystem: A systematic literature review
    Cui, Yesheng
    Kara, Sami
    Chan, Ka C.
    [J]. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2020, 62