An Outlier Detection Algorithm Based on Probability Density Clustering

被引:1
|
作者
Wang, Wei [1 ]
Ren, Yongjian [2 ]
Zhou, Renjie [3 ]
Zhang, Jilin [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
[2] Hangzhou Dianzi Univ, Hangzhou, Peoples R China
[3] Hangzhou Dianzi Univ, Comp & Software Sch, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Outlier Detection; Ratio of Average Probability Density to Probability Density; Sparse Clusters and Dense Clusters are Close Together; DISTANCE-BASED OUTLIERS; DATA STREAM; EFFICIENT; CLASSIFICATION;
D O I
10.4018/IJDWM.333901
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Outlier detection for batch and streaming data is an important branch of data mining. However, there are shortcomings for existing algorithms. For batch data, the outlier detection algorithm, only labeling a few data points, is not accurate enough because it uses histogram strategy to generate feature vectors. For streaming data, the outlier detection algorithms are sensitive to data distance, resulting in low accuracy when sparse clusters and dense clusters are close to each other. Moreover, they require tuning of parameters, which takes a lot of time. With this, the manuscript per the authors propose a new outlier detection algorithm, called PDC which use probability density to generate feature vectors to train a lightweight machine learning model that is finally applied to detect outliers. PDC takes advantages of accuracy and insensitivity-to-data-distance of probability density, so it can overcome the aforementioned drawbacks.
引用
收藏
页码:22 / 22
页数:1
相关论文
共 50 条
  • [1] Outlier detection algorithm based on fast density peak clustering outlier factor
    Zhang, Zhongping
    Li, Sen
    Liu, Weixiong
    Liu, Shuxia
    [J]. Tongxin Xuebao/Journal on Communications, 2022, 43 (10): : 186 - 195
  • [2] A New Outlier Detection Algorithm Based on Fast Density Peak Clustering Outlier Factor
    Zhang, ZhongPing
    Li, Sen
    Liu, WeiXiong
    Wang, Ying
    Li, Daisy Xin
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2023, 19 (02)
  • [3] A Hybrid Outlier Detection Algorithm Based On Partitioning Clustering And Density Measures
    Rizk, Hamada
    Elgokhy, Sherin
    Sarhan, Amany
    [J]. 2015 TENTH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2015, : 175 - 181
  • [4] An Outlier Detection Algorithm Based on Spectral Clustering
    Yang, Peng
    Huang, Biao
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 485 - 488
  • [5] An Effective Algorithm of Outlier Detection Based on Clustering
    Xia, Qingsong
    Xing, Changzheng
    Li, Na
    [J]. INTERNET OF THINGS-BK, 2012, 312 : 346 - 351
  • [6] Outlier Detection Algorithm Based on Iterative Clustering
    古平
    罗辛
    杨瑞龙
    张程
    [J]. Journal of Donghua University(English Edition), 2015, 32 (04) : 554 - 558
  • [7] Clustering Algorithm Based on Outlier Detection for Anomaly Intrusion Detection
    Yin, Shang-Nan
    Kang, Ho-Seok
    Kim, Sung-Ryul
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2016, 17 (02): : 291 - 299
  • [8] Unifying Density-Based Clustering and Outlier Detection
    Tao, Yunxin
    Pi, Dechang
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 644 - 647
  • [9] Trajectory outlier detection based on DBSCAN clustering algorithm
    [J]. 1600, Chinese Society of Astronautics (46):
  • [10] An Outlier Detection Algorithm Based on Arbitrary Shape Clustering
    Su, Xiaoke
    Lan, Yang
    Wan, Renxia
    Qin, Yuming
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 627 - +