Parallel Implementation of Classification Algorithms Based on MapReduce

被引:0
|
作者
He, Qing [1 ]
Zhuang, Fuzhen [1 ]
Li, Jincheng [1 ]
Shi, Zhongzhi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
来源
关键词
Data Mining; Classification; Parallel Implementation; Large Dataset; MapReduce;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining has attracted extensive research for several decades. As an important task of data mining, classification plays an important role in information retrieval, web searching, CRM, etc. Most of the present classification techniques are serial, which become impractical for large dataset. The computing resource is under-utilized and the executing time is not waitable. Provided the program mode of MapReduce, we propose the parallel implementation methods of several classification algorithms, such as k-nearest neighbors, naive bayesian model and decision tree, etc. Preparatory experiments show that the proposed parallel methods can not only process large dataset, but also can be extended to execute on a cluster, which can significantly improve the efficiency.
引用
收藏
页码:655 / 662
页数:8
相关论文
共 50 条
  • [31] Parallel Attribute Reduction Based on MapReduce
    Xi, Dachao
    Wang, Guoyin
    Zhang, Xuerui
    Zhang, Fan
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 631 - 641
  • [32] Parallel Clustering Validation Based on MapReduce
    Zerabi, Soumeya
    Meshoul, Souham
    Khantoul, Bilel
    [J]. ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2019, 50 : 291 - 299
  • [33] Simple Implementation of Parallel Genetic Algorithms Based on Cloud Computing
    Zhao, Jianfeng
    Zeng, Wenghua
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (11A): : 4367 - 4372
  • [34] Parallel Implementation of Chi2 Algorithm in MapReduce Framework
    Zhang, Yong
    Yu, Jingwen
    Wang, Jianying
    [J]. HUMAN CENTERED COMPUTING, HCC 2014, 2015, 8944 : 890 - 899
  • [35] ABDF Integratable Machine Learning Algorithms-MapReduce Implementation
    Sreeveni, Unmesha U. B.
    Sathyadevan, Shiju
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 297 - 306
  • [36] A MapReduce Cortical Algorithms Implementation for Unsupervised Learning of Big Data
    Hajj, Nadine
    Rizk, Yara
    Awad, Mariette
    [J]. INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 327 - 334
  • [37] A Parallel Bit-map based Framework for Classification Algorithms
    De Silva, Amila
    Perera, Shehan
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2019, : 259 - 266
  • [38] MapReduce Implementation for Minimum Reduct Using Parallel Genetic Algorithm
    Alshammari, Mashaan A.
    El-Alfy, El-Sayed M.
    [J]. 2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2015, : 13 - 18
  • [40] Exploration and Implementation of Classification Algorithms for Patent Classification
    Naik, Darshana A.
    Seema, S.
    Singh, Geetika
    Singh, Abhinav
    [J]. COMPUTING AND NETWORK SUSTAINABILITY, 2019, 75