Parallel Implementation of Classification Algorithms Based on MapReduce

被引:0
|
作者
He, Qing [1 ]
Zhuang, Fuzhen [1 ]
Li, Jincheng [1 ]
Shi, Zhongzhi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
来源
关键词
Data Mining; Classification; Parallel Implementation; Large Dataset; MapReduce;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining has attracted extensive research for several decades. As an important task of data mining, classification plays an important role in information retrieval, web searching, CRM, etc. Most of the present classification techniques are serial, which become impractical for large dataset. The computing resource is under-utilized and the executing time is not waitable. Provided the program mode of MapReduce, we propose the parallel implementation methods of several classification algorithms, such as k-nearest neighbors, naive bayesian model and decision tree, etc. Preparatory experiments show that the proposed parallel methods can not only process large dataset, but also can be extended to execute on a cluster, which can significantly improve the efficiency.
引用
收藏
页码:655 / 662
页数:8
相关论文
共 50 条
  • [41] ON THE PARALLEL IMPLEMENTATION OF JACOBI AND KOGBETLIANTZ ALGORITHMS
    GOTZE, J
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1994, 15 (06): : 1331 - 1348
  • [42] PARALLEL IMPLEMENTATION OF FAST CLUSTERING ALGORITHMS
    BRUYNOOGHE, M
    [J]. HIGH PERFORMANCE COMPUTING /, 1989, : 65 - 78
  • [43] Implementation of Parallel Algorithms on Cluster of Workstations
    Shrimankar, D. D.
    Sathe, S. R.
    [J]. 2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 126 - 131
  • [44] IMPLEMENTATION AND VALIDATION OF PARALLEL ALGORITHMS ON MONOCOMPUTERS
    RATHKE, M
    [J]. ANGEWANDTE INFORMATIK, 1983, (08): : 337 - 344
  • [45] LOGIC VERIFICATION ALGORITHMS AND THEIR PARALLEL IMPLEMENTATION
    TONY, HK
    DEVADAS, S
    WEI, RS
    VINCENTELLI, AS
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1989, 8 (02) : 181 - 189
  • [46] PARALLEL IMPLEMENTATION OF FIELD SOLUTION ALGORITHMS
    IDA, N
    WANG, JS
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 1988, 24 (01) : 291 - 294
  • [47] Parallel Top-K Similarity Join Algorithms Using MapReduce
    Kim, Younghoon
    Shim, Kyuseok
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 510 - 521
  • [48] Parallel implementation of stochastic iteration algorithms
    Martínez, R
    Szirmay-Kalos, L
    Sbert, M
    Abbas, AM
    [J]. W S C G ' 2001, VOLS I & II, CONFERENCE PROCEEDINGS, 2001, : 344 - 351
  • [49] Parallel Implementation of Median String Algorithms
    Mirabal, Pedro
    Lincolao-Venegas, Ignacio
    Castillo-Sanhueza, Mario
    Abreu, Jose
    [J]. 2021 40TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2021,
  • [50] Implementation of parallel algorithms for LUC cryptosystem
    Ali, Zulkarnain Md
    Othman, Mohamed
    Said, Mohd Rushdan Mohd
    Sulaiman, Mohammad Nasir
    [J]. PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 726 - +