The Application of Big Data Mining Prediction Based on Improved K-Means Algorithm

被引:0
|
作者
Qiao, Yuchen [1 ]
Li, Yunlu [1 ]
Lv, Xiaotian [1 ]
机构
[1] Wuhan Univ Technol, Sch Automat, Wuhan, Peoples R China
关键词
K-Means algorithm; Various attributes; Data mining; Principal component analysis; Annual income forecast;
D O I
10.1109/yac.2019.8787670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to solve the problem of low efficiency of K-Means algorithm in processing the data mining prediction problem of big data with more attributes, an annual income prediction method of residents based on improved K-Means algorithm is proposed. The improved K-Means algorithm combines the principal component analysis method with the traditional K-Means algorithm. After reducing the dimensionality of various data attributes, the data are classified with K-Means algorithm. The research makes use of 1994 U.S. census database and conducts a contrastive analysis of the two algorithms. The results show that the prediction accuracy has been significantly improved by 13.3313%, from 53.1016% to 66.4329% It is clear the improved algorithm can effectively improve the accuracy of clustering and annual income prediction.
引用
收藏
页码:348 / 351
页数:4
相关论文
共 50 条
  • [1] Application of an improved K-Means algorithm in data mining
    Wang, JM
    Guo, H
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2: INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT IN THE GLOBAL ECONOMY, 2005, : 416 - 419
  • [2] An improved K-means algorithm for big data
    Moodi, Fatemeh
    Saadatfar, Hamid
    [J]. IET SOFTWARE, 2022, 16 (01) : 48 - 59
  • [3] A K-Means Algorithm Application on Big Data
    Eren, Beste
    Karabulut, Ezgi Cilga
    Alptekin, S. Emre
    Alptekin, Gulfem Isiklar
    [J]. WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL II, 2015, : 814 - 818
  • [4] ANALYSIS AND APPLICATION OF BIG DATA FEATURE EXTRACTION BASED ON IMPROVED K-MEANS ALGORITHM
    Yang, Wenjuan
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (01): : 137 - 145
  • [5] Research on parallel association rule mining of big data based on an improved K-means clustering algorithm
    Hao, Li
    Wang, Tuanbu
    Guo, Chaoping
    [J]. INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (03) : 233 - 247
  • [6] Improved K-Means Clustering Algorithm for Big Data Mining under Hadoop Parallel Framework
    Lu, Weijia
    [J]. JOURNAL OF GRID COMPUTING, 2020, 18 (02) : 239 - 250
  • [7] Improved K-Means Clustering Algorithm for Big Data Mining under Hadoop Parallel Framework
    Weijia Lu
    [J]. Journal of Grid Computing, 2020, 18 : 239 - 250
  • [8] Improvement of the Fast Clustering Algorithm Improved by K-Means in the Big Data
    Xie, Ting
    Liu, Ruihua
    Wei, Zhengyuan
    [J]. APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2020, 5 (01) : 1 - 10
  • [9] SMK-means: An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data
    Xiao, Bo
    Wang, Zhen
    Liu, Qi
    Liu, Xiaodong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 56 (03): : 365 - 379
  • [10] The fast clustering algorithm for the big data based on K-means
    Xie, Ting
    Zhang, Taiping
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (06)