Research on the application of cloud computing in data mining algorithm

被引:0
|
作者
Fang, Jia-Juan [1 ]
Li, Xiao-Ling [2 ]
机构
[1] Zhengzhou Tech Coll, Software Engn Dept, Zhengzhou 450121, Henan, Peoples R China
[2] Zhongyuan Univ Technol, Coll Informat & Business, Dept Informat Technol, Zhengzhou 450007, Peoples R China
来源
AGRO FOOD INDUSTRY HI-TECH | 2017年 / 28卷 / 03期
关键词
Cloud computing; parallelization; association rules; clustering algorithm;
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The key of data mining algorithm in cloud computing environment is association rules and cluster analysis. The Apriori algorithm and K-means algorithm which are widely used nowadays have many problems such as long scanning time and large memory consumption. Therefore, a parallelized design scheme was proposed in this paper, which improved Apriori algorithm and K-means algorithm and was practiced by utilizing Hadoop platform, the feasibility of parallel project in massive data processing was discussed. The results showed that this proposed method could reduce the computational load of single node, reduce the computing time and improve the efficiency of the algorithm by repeating the calculation work on each node.
引用
收藏
页码:1055 / 1060
页数:6
相关论文
共 50 条