MapReduce Solutions Classification by Their Implementation

被引:0
|
作者
Orynbekova, Kamila [1 ]
Bogdanchikov, Andrey [1 ]
Cankurt, Selcuk [2 ]
Adamov, Abzatdin [3 ]
Kadyrov, Shirali [1 ]
机构
[1] Suleyman Demirel Univ, Alma Ata, Kazakhstan
[2] Vistula Univ, Warsaw, Poland
[3] ADA Univ, Baku, Azerbaijan
来源
关键词
MapReduce; big data; Apache Hadoop; Apache Spark; problems classification; solutions categorization; course design;
D O I
10.3991/ijep.v13i5.38867
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Distributed Systems are widely used in industrial projects and scientific research. The Apache Hadoop environment, which works on the MapReduce paradigm, lost popularity because new, modern tools were developed. For example, Apache Spark is preferred in some cases since it uses RAM resources to hold intermediate calculations; therefore, it works faster and is easier to use. In order to take full advantage of it, users must think about the MapReduce concept. In this paper, a usual solution and MapReduce solution of ten problems were compared by their pseudocodes and categorized into five groups. According to these groups' descriptions and pseudocodes, readers can get a concept of MapReduce without taking specific courses. This paper proposes a five-category classification methodology to help distributed-system users learn the MapReduce paradigm fast. The proposed methodology is illustrated with ten tasks. Furthermore, statistical analysis is carried out to test if the proposed classification methodology affects learner performance. The results of this study indicate that the proposed model outperforms the traditional approach with statistical significance, as evidenced by a p-value of less than 0.05. The policy implication is that educational institutions and organizations could adopt the proposed classification methodology to help learners and employees acquire the necessary knowledge and skills to use distributed systems effectively.
引用
收藏
页码:58 / 71
页数:14
相关论文
共 50 条
  • [1] Parallel Implementation of Classification Algorithms Based on MapReduce
    He, Qing
    Zhuang, Fuzhen
    Li, Jincheng
    Shi, Zhongzhi
    [J]. ROUGH SET AND KNOWLEDGE TECHNOLOGY (RSKT), 2010, 6401 : 655 - 662
  • [2] Improved KNN Text Classification Algorithm with MapReduce Implementation
    Zhao, Yan
    Qian, Yun
    Li, Cuixia
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1417 - 1422
  • [3] Luminance Control with MapReduce Implementation
    He, Zen-long
    Lee, Shie-jue
    Wu, Chih-hung
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS, CONTROL AND AUTOMATION (ICA 2015), 2015, : 31 - 35
  • [4] A MapReduce based approach for classification
    Haldankar, Akash
    Bhowmick, Kiran
    [J]. PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [5] MapReduce based for speech classification
    Quang Trung Nguyen
    The Duy Bui
    [J]. PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 87 - 91
  • [6] Classification of Knowledge Processing by MapReduce
    Benhamed, Siham
    Nait-Bahloul, Safia
    [J]. 2014 4TH INTERNATIONAL SYMPOSIUM ISKO-MAGHREB: CONCEPTS AND TOOLS FOR KNOWLEDGE MANAGEMENT (ISKO-MAGHREB), 2014,
  • [7] Tenzing A SQL Implementation On The MapReduce Framework
    Chattopadhyay, Biswapesh
    Lin, Liang
    Liu, Weiran
    Mittal, Sagar
    Aragonada, Prathyusha
    Lychagina, Vera
    Kwon, Younghee
    Wong, Michael
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (12): : 1318 - 1327
  • [8] Tuple MapReduce and Pangool: an associated implementation
    Ferrera, Pedro
    De Prado, Ivan
    Palacios, Eric
    Fernandez-Marquez, Jose Luis
    Serugendo, Giovanna Di Marzo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 531 - 557
  • [9] MapReduce Model Implementation on MPI Platform
    Guo Yucheng
    [J]. PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, : 88 - 91
  • [10] Tuple MapReduce and Pangool: an associated implementation
    Pedro Ferrera
    Ivan De Prado
    Eric Palacios
    Jose Luis Fernandez-Marquez
    Giovanna Di Marzo Serugendo
    [J]. Knowledge and Information Systems, 2014, 41 : 531 - 557