A Systematic Literature Review of Big Data and the Hadoop frameworks

被引:0
|
作者
Naidu, Devishree [1 ]
Thakur, Adi [1 ]
机构
[1] Shri Ramdeobaba Coll Engn & Management, Comp Sci & Engn Dept, Nagpur, Maharashtra, India
关键词
Big data; Flume; Map Reduce; Hadoop Ecosystem; Hadoop frameworks;
D O I
10.9756/INT-JECSE/V14I2.287
中图分类号
G76 [特殊教育];
学科分类号
040109 ;
摘要
Big data is a term to define the huge amount of data gathered mostly through new data sources like Twitter, Instagram, Facebook etc. This data is important as its analysis is changing how major businesses work and has the ability to provide the knowledge required to cut back business costs. Most firms are currently using this technology to accurately find trends and predict future events in their various industries. The challenge lies in finding the best way to process, analyze and draw useful insights from this data. This data cannot be handled efficiently by the traditional data management tools and hence required some other advanced data technologies. This is mainly because of its unstructured nature and the five V's - Volume, Variety, Velocity, Value, and Veracity which we mostly use to define big data are the main reason why its handling is a major challenge. Since this data is growing at an exponential rate, it was a necessity a develop technologies to address it. Hadoop, Map Reduce, and No SQL are the major three technologies that were developed to handle the complexities of big data and manage it reliably. This paper discusses the several technologies based on Hadoop which is altogether called the Hadoop ecosystem and their uses in analyzing big data.
引用
收藏
页码:2969 / 2973
页数:5
相关论文
共 50 条
  • [31] Challenges and Issues in Unstructured Big Data: A Systematic Literature Review
    Nafis, Nur Syafiqah Mohd
    Awang, Suryanti
    ADVANCED SCIENCE LETTERS, 2018, 24 (10) : 7716 - 7722
  • [32] Big data and sentiment analysis: A comprehensive and systematic literature review
    Hajiali, Mahdi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (14):
  • [33] Task Scheduling in Big Data Platforms: A Systematic Literature Review
    Soualhia, Mbarka
    Khomh, Foutse
    Tahar, Sofiene
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 134 : 170 - 189
  • [34] Energy-efficient hadoop for big data analytics and computing: A systematic review and research insights
    Wu, WenTai
    Lin, WeiWei
    Hsu, Ching-Hsien
    He, LiGang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 1351 - 1367
  • [35] A comprehensive view of Hadoop research - A systematic literature review
    Polato, I. (ipolato@utfpr.edu.br), 1600, Academic Press (46):
  • [36] Small files? problem in Hadoop: A systematic literature review
    Aggarwal, Raveena
    Verma, Jyoti
    Siwach, Manvi
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 8658 - 8674
  • [37] A comprehensive view of Hadoop research - A systematic literature review
    Polato, Ivanilton
    Ré, Reginaldo
    Goldman, Alfredo
    Kon, Fabio
    Journal of Network and Computer Applications, 2014, 46 : 1 - 25
  • [38] Quality Assurance Technologies of Big Data Applications: A Systematic Literature Review
    Ji, Shunhui
    Li, Qingqiu
    Cao, Wennan
    Zhang, Pengcheng
    Muccini, Henry
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 31
  • [39] Systematic Review of the Literature on Big Data in the Transportation Domain: Concepts and Applications
    Neilson, Alex
    Indratmo
    Daniel, Ben
    Tjandra, Stevanus
    BIG DATA RESEARCH, 2019, 17 : 35 - 44
  • [40] Big data analytics capabilities: a systematic literature review and research agenda
    Patrick Mikalef
    Ilias O. Pappas
    John Krogstie
    Michail Giannakos
    Information Systems and e-Business Management, 2018, 16 : 547 - 578