MapReduce Algorithms for Big Data Analysis

被引:0
|
作者
Shim, Kyuseok [1 ]
机构
[1] Seoul Natl Univ, Elect & Comp Engn Dept, Seoul, South Korea
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a growing trend of applications that should handle big data. However, analyzing big data is very challenging today. For such applications, the MapReduce framework has recently attracted a lot of attention. MapReduce is a programming model that allows easy development of scalable parallel applications to process big data on large clusters of commodity machines. Google's MapReduce or its open-source equivalent Hadoop is a powerful tool for building such applications. In this tutorial, I will first introduce the MapReduce framework based on Hadoop system available to everyone to run distributed computing algorithms using MapReduce. I will next discuss how to design efficient MapReduce algorithms and present the state-of-the-art in MapReduce algorithms for big data analysis. Since Spark is recently developed to overcome the shortcomings of MapReduce which is not optimized for of iterative algorithms and interactive data analysis, I will also present an outline of Spark as well as the differences between MapReduce and Spark. The intended audience of this tutorial is professionals who plan to develop efficient MapReduce algorithms and researchers who should be aware of the state-of-the-art in MapReduce algorithms available today for big data analysis.
引用
收藏
页码:XV / XV
页数:1
相关论文
共 50 条
  • [31] Cross-Cloud MapReduce for Big Data
    Li, Peng
    Guo, Song
    Yu, Shui
    Zhuang, Weihua
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 375 - 386
  • [32] Evolving Big Data Stream Classification with MapReduce
    Haque, Ahsanul
    Parker, Brandon
    Khan, Latifur
    Thuraisingham, Bhavani
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 570 - 577
  • [33] A Mapreduce Fuzzy Techniques of Big Data Classification
    El Bakry, Malak
    Safwat, Soha
    Hegazy, Osman
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128
  • [34] Design of MapReduce and CTA for Big Data System
    Kim, Earl
    Shin, Dong-ryeol
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (CAAI 2015), 2015, : 294 - 297
  • [35] Clustering on Big Data Using Hadoop MapReduce
    Akthar, Nadeem
    Ahamad, Mohd Vasim
    Khan, Shahbaz
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
  • [36] Efficient Big Data Processing in Hadoop MapReduce
    Dittrich, Jens
    Quiane-Ruiz, Jorge-Arnulfo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015
  • [37] Correlated Topic Modeling for Big Data with MapReduce
    Oo, Mi Khine
    Khine, May Aye
    [J]. 2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 408 - 409
  • [38] Big Data Analytics based on PANFIS MapReduce
    Za'in, Choiru
    Pratama, Mahardhika
    Lughofer, Edwin
    Ferdaus, Meftahul
    Cai, Qing
    Prasad, Mukesh
    [J]. INNS CONFERENCE ON BIG DATA AND DEEP LEARNING, 2018, 144 : 140 - 152
  • [39] Standardizing Unstructured Big Data and Visual Interpretation using MapReduce and Correspondence Analysis
    Choi, Joseph
    Choi, Yong-Seok
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2014, 27 (02) : 169 - 183
  • [40] Modeling and Analysis of Hadoop MapReduce Systems for Big Data Using Petri Nets
    Chiang, Dai-Lun
    Wang, Sheng-Kuan
    Wang, Yu-Ying
    Lin, Yi-Nan
    Hsieh, Tsang-Yen
    Yang, Cheng-Ying
    Shen, Victor R. L.
    Ho, Hung-Wei
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (01) : 80 - 104