MapReduce Algorithms for Big Data Analysis

被引：0

作者：

Shim, Kyuseok ^{[1
]}

机构：

[1] Seoul Natl Univ, Elect & Comp Engn Dept, Seoul, South Korea

来源：

DATABASES THEORY AND APPLICATIONS, ADC 2018 | 2018年 / 10837卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

There is a growing trend of applications that should handle big data. However, analyzing big data is very challenging today. For such applications, the MapReduce framework has recently attracted a lot of attention. MapReduce is a programming model that allows easy development of scalable parallel applications to process big data on large clusters of commodity machines. Google's MapReduce or its open-source equivalent Hadoop is a powerful tool for building such applications. In this tutorial, I will first introduce the MapReduce framework based on Hadoop system available to everyone to run distributed computing algorithms using MapReduce. I will next discuss how to design efficient MapReduce algorithms and present the state-of-the-art in MapReduce algorithms for big data analysis. Since Spark is recently developed to overcome the shortcomings of MapReduce which is not optimized for of iterative algorithms and interactive data analysis, I will also present an outline of Spark as well as the differences between MapReduce and Spark. The intended audience of this tutorial is professionals who plan to develop efficient MapReduce algorithms and researchers who should be aware of the state-of-the-art in MapReduce algorithms available today for big data analysis.

引用

页码：XV / XV

页数：1

共 50 条

[31] Cross-Cloud MapReduce for Big Data
Li, Peng
Guo, Song
Yu, Shui
Zhuang, Weihua
[J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 375 - 386
[32] Evolving Big Data Stream Classification with MapReduce
Haque, Ahsanul
Parker, Brandon
Khan, Latifur
Thuraisingham, Bhavani
[J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 570 - 577
[33] A Mapreduce Fuzzy Techniques of Big Data Classification
El Bakry, Malak
Safwat, Soha
Hegazy, Osman
[J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128
[34] Design of MapReduce and CTA for Big Data System
Kim, Earl
Shin, Dong-ryeol
[J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (CAAI 2015), 2015, : 294 - 297
[35] Clustering on Big Data Using Hadoop MapReduce
Akthar, Nadeem
Ahamad, Mohd Vasim
Khan, Shahbaz
[J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
[36] Efficient Big Data Processing in Hadoop MapReduce
Dittrich, Jens
Quiane-Ruiz, Jorge-Arnulfo
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015
[37] Correlated Topic Modeling for Big Data with MapReduce
Oo, Mi Khine
Khine, May Aye
[J]. 2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 408 - 409
[38] Big Data Analytics based on PANFIS MapReduce
Za'in, Choiru
Pratama, Mahardhika
Lughofer, Edwin
Ferdaus, Meftahul
Cai, Qing
Prasad, Mukesh
[J]. INNS CONFERENCE ON BIG DATA AND DEEP LEARNING, 2018, 144 : 140 - 152
[39] Standardizing Unstructured Big Data and Visual Interpretation using MapReduce and Correspondence Analysis
Choi, Joseph
Choi, Yong-Seok
[J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2014, 27 (02) : 169 - 183
[40] Modeling and Analysis of Hadoop MapReduce Systems for Big Data Using Petri Nets
Chiang, Dai-Lun
Wang, Sheng-Kuan
Wang, Yu-Ying
Lin, Yi-Nan
Hsieh, Tsang-Yen
Yang, Cheng-Ying
Shen, Victor R. L.
Ho, Hung-Wei
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (01) : 80 - 104

← 1 2 3 4 5 →