Outlier mining algorithm based on data-partitioning and density-grid

被引:0
|
作者
Xing, Chang Zheng [1 ]
Tang, Cheng Long [1 ]
Wei, Ke [1 ]
机构
[1] Liaoning Tech Univ, Sch Elect & Informat Engn, Huludao, Peoples R China
关键词
data mining; outlier data; density-grid; data partitioning; cell; micro-cell; cell dimension tree(CD-Tree);
D O I
10.1109/ICCECT.2012.34
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing outlier mining algorithms such as FOMAUC are based on density-grid. These algorithms have the problems of inefficiency and bad-adaptability for various data sets, so this paper proposes an outlier mining algorithm based on data partitioning and grid-density. Firstly, the technology of data partitioning was applied. Secondly, the nonoutliers were filtered out by cell and the temporary results were saved. Thirdly, the improved CD-Tree was created to maintain the spatial information of the reserved data. After that, the nonoutliers were filtered out by micro-cell and were operated efficiently through two optimization strategies. Finally, followed by mining by data point the resulting outlier set was obtained. Theoretical analysis and the experimental results show that this method is feasible and effective, and that has better scalability for dealing with massive and high dimensional data.
引用
收藏
页码:880 / 884
页数:5
相关论文
共 50 条
  • [1] A Clustering Algorithm Based on Density-Grid for Stream Data
    Zhang, Dandan
    Tian, Hui
    Sang, Yingpeng
    Li, Yidong
    Wu, Yanbo
    Wu, Jun
    Shen, Hong
    [J]. 2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 398 - 403
  • [2] A Density-Grid Based Clustering Algorithm on Data Stream Using Resilient Distributed Datasets
    Zhang, Yuan
    Zhang, Jiongmin
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 316 - 322
  • [3] A Distributed Density-Grid Clustering Algorithm for Multi-Dimensional Data
    Brown, Daniel
    Shi, Yong
    [J]. 2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 1 - 7
  • [4] A Fast Density-Grid Based Clustering Method
    Brown, Daniel
    Japa, Arialdis
    Shi, Yong
    [J]. 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 48 - 54
  • [5] Stacked Denoising Autoencoder With Density-Grid Based Clustering Method for Detecting Outlier of Wind Turbine Components
    Sun, Zexian
    Sun, Hexu
    [J]. IEEE ACCESS, 2019, 7 : 13078 - 13091
  • [6] Sub-Grid Partitioning Algorithm for Distributed Outlier Detection on Big Data
    Sakr, Mohamed
    Atwa, Walid
    Keshk, Arabi
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 252 - 257
  • [7] Big Data Outlier Detection Algorithm Based on Grid
    Guo Wei-Wei
    Liu Feng
    [J]. 2018 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2018), 2018, : 274 - 277
  • [8] A Data Stream Outlier Detection Algorithm Based on Grid
    Yu Xiang
    Lei Guohua
    Xu Xiandong
    Lin Liandong
    [J]. 2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4136 - 4141
  • [9] A data mining algorithm based on grid
    Zang, XB
    Li, XF
    Zhao, K
    Guan, X
    [J]. GRID AND COOPERATIVE COMPUTING, PT 2, 2004, 3033 : 807 - 810
  • [10] A Novel Data Purification Algorithm Based On Outlier Mining
    Dong, Jianfeng
    Wang, Xiaofeng
    Hu, Feng
    Xiao, Liyan
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 95 - +