An Outlier Detection Algorithm Based on Probability Density Clustering

被引:1
|
作者
Wang, Wei [1 ]
Ren, Yongjian [2 ]
Zhou, Renjie [3 ]
Zhang, Jilin [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
[2] Hangzhou Dianzi Univ, Hangzhou, Peoples R China
[3] Hangzhou Dianzi Univ, Comp & Software Sch, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Outlier Detection; Ratio of Average Probability Density to Probability Density; Sparse Clusters and Dense Clusters are Close Together; DISTANCE-BASED OUTLIERS; DATA STREAM; EFFICIENT; CLASSIFICATION;
D O I
10.4018/IJDWM.333901
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Outlier detection for batch and streaming data is an important branch of data mining. However, there are shortcomings for existing algorithms. For batch data, the outlier detection algorithm, only labeling a few data points, is not accurate enough because it uses histogram strategy to generate feature vectors. For streaming data, the outlier detection algorithms are sensitive to data distance, resulting in low accuracy when sparse clusters and dense clusters are close to each other. Moreover, they require tuning of parameters, which takes a lot of time. With this, the manuscript per the authors propose a new outlier detection algorithm, called PDC which use probability density to generate feature vectors to train a lightweight machine learning model that is finally applied to detect outliers. PDC takes advantages of accuracy and insensitivity-to-data-distance of probability density, so it can overcome the aforementioned drawbacks.
引用
收藏
页码:22 / 22
页数:1
相关论文
共 50 条
  • [41] Location algorithm of transfer stations based on density peak and outlier detection
    Yan Shao-hong
    Niu Jia-yang
    Chen Tai-long
    Liu Qiu-tong
    Yang Cen
    Cheng Jia-qing
    Fu Zhi-zhen
    Li Jie
    [J]. Applied Intelligence, 2022, 52 : 13520 - 13532
  • [42] Outlier Detection Using a GPU-Based Parallel Algorithm: Quantum Clustering
    Liu, Ding
    Wang, Zhe
    Li, Hui
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2024, 33 (04)
  • [43] A New Outlier Detection Algorithm Based on Kernel Density Estimation for ITS
    Xu, Yiwen
    Xu, Ningbin
    Feng, Xinxin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2016, : 258 - 262
  • [44] Density-Distance Outlier Detection Algorithm Based on Natural Neighborhood
    Zhang, Jiaxuan
    Yang, Youlong
    [J]. AXIOMS, 2023, 12 (05)
  • [45] A distributed density-based outlier detection algorithm on big data
    Mei, Lin
    Zhang, Fengli
    [J]. International Journal of Network Security, 2020, 22 (05): : 775 - 781
  • [46] Location algorithm of transfer stations based on density peak and outlier detection
    Yan Shao-hong
    Niu Jia-yang
    Chen Tai-long
    Liu Qiu-tong
    Yang Cen
    Cheng Jia-qing
    Fu Zhi-zhen
    Li Jie
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 13520 - 13532
  • [47] An Outlier Detection Algorithm based on KNN-kernel Density Estimation
    Wahid, Abdul
    Rao, Annavarapu Chandra Sekhara
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [48] An incremental outlier factor based clustering algorithm
    Zhou, YF
    Liu, QB
    Deng, S
    Yang, Q
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1358 - 1361
  • [49] Modified genetic algorithm-based clustering for probability density functions
    Tai Vo-Van
    Trung Nguyen-Thoi
    Trung Vo-Duy
    Vinh Ho-Huu
    Thao Nguyen-Trang
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (10) : 1964 - 1979
  • [50] A jackknife entropy-based clustering algorithm for probability density functions
    Chen, Jen-Hao
    Hung, Wen-Liang
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (05) : 861 - 875