An Outlier Detection Algorithm Based on Probability Density Clustering

被引:1
|
作者
Wang, Wei [1 ]
Ren, Yongjian [2 ]
Zhou, Renjie [3 ]
Zhang, Jilin [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
[2] Hangzhou Dianzi Univ, Hangzhou, Peoples R China
[3] Hangzhou Dianzi Univ, Comp & Software Sch, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Outlier Detection; Ratio of Average Probability Density to Probability Density; Sparse Clusters and Dense Clusters are Close Together; DISTANCE-BASED OUTLIERS; DATA STREAM; EFFICIENT; CLASSIFICATION;
D O I
10.4018/IJDWM.333901
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Outlier detection for batch and streaming data is an important branch of data mining. However, there are shortcomings for existing algorithms. For batch data, the outlier detection algorithm, only labeling a few data points, is not accurate enough because it uses histogram strategy to generate feature vectors. For streaming data, the outlier detection algorithms are sensitive to data distance, resulting in low accuracy when sparse clusters and dense clusters are close to each other. Moreover, they require tuning of parameters, which takes a lot of time. With this, the manuscript per the authors propose a new outlier detection algorithm, called PDC which use probability density to generate feature vectors to train a lightweight machine learning model that is finally applied to detect outliers. PDC takes advantages of accuracy and insensitivity-to-data-distance of probability density, so it can overcome the aforementioned drawbacks.
引用
收藏
页码:22 / 22
页数:1
相关论文
共 50 条
  • [11] A Spectral Clustering Algorithm for Outlier Detection
    Yang, Peng
    Huang, Biao
    [J]. 2008 INTERNATIONAL SEMINAR ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, PROCEEDINGS, 2008, : 33 - 36
  • [12] Outlier Detection in Dairy Cows Estrus Based on Density Clustering
    Liu Jindi
    Zhu Huaji
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2291 - 2294
  • [13] RDOF: An outlier detection algorithm based on relative density
    Wahid, Abdul
    Rao, Annavarapu Chandra Sekhara
    [J]. EXPERT SYSTEMS, 2022, 39 (02)
  • [14] Density-based trajectory outlier detection algorithm
    Zhipeng Liu
    Dechang Pi
    Jinfeng Jiang
    [J]. Journal of Systems Engineering and Electronics, 2013, 24 (02) : 335 - 340
  • [15] Relative Density-Based Outlier Detection Algorithm
    Ning, Jin
    Chen, Leiting
    Chen, Junwei
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 227 - 231
  • [16] Density-based trajectory outlier detection algorithm
    Liu, Zhipeng
    Pi, Dechang
    Jiang, Jinfeng
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2013, 24 (02) : 335 - 340
  • [17] An Outlier Detection Algorithm for Data Streams Based on Fuzzy Clustering
    Su, Xiaoke
    Qin, Yuming
    Wan, Renxia
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 109 - 112
  • [18] An Outlier Detection Algorithm in Wireless Sensor Network Based on Clustering
    Niu, Kun
    Zhao, Fang
    Qiao, Xiuquan
    [J]. 2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 433 - 437
  • [19] Clustering for probability density functions based on Genetic Algorithm
    Tai, V. V.
    Thao, N. T.
    Ha, C. N.
    [J]. APPLIED MATHEMATICS IN ENGINEERING AND RELIABILITY, 2016, : 51 - 57
  • [20] Multi-radius Density Clustering Algorithm Based on Outlier Factor
    Ye, Zonglin
    Cao, Hui
    Jia, Lixin
    Zhang, Yanbin
    Si, Gangquan
    [J]. MECHANICAL SCIENCE AND ENGINEERING IV, 2014, 472 : 427 - 431