Density-Distance Outlier Detection Algorithm Based on Natural Neighborhood

被引:2
|
作者
Zhang, Jiaxuan [1 ]
Yang, Youlong [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710126, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; natural neighbors; adaptive kernel density estimation; local density; relative distance; EFFICIENT;
D O I
10.3390/axioms12050425
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Outlier detection is of great significance in the domain of data mining. Its task is to find those target points that are not identical to most of the object generation mechanisms. The existing algorithms are mainly divided into density-based algorithms and distance-based algorithms. However, both approaches have some drawbacks. The former struggles to handle low-density modes, while the latter cannot detect local outliers. Moreover, the outlier detection algorithm is very sensitive to parameter settings. This paper proposes a new two-parameter outlier detection (TPOD) algorithm. The method proposed in this paper does not need to manually define the number of neighbors, and the introduction of relative distance can also solve the problem of low density and further accurately detect outliers. This is a combinatorial optimization problem. Firstly, the number of natural neighbors is iteratively calculated, and then the local density of the target object is calculated by adaptive kernel density estimation. Secondly, the relative distance of the target points is computed through natural neighbors. Finally, these two parameters are combined to obtain the outlier factor. This eliminates the influence of parameters that require users to determine the number of outliers themselves, namely, the top-n effect. Two synthetic datasets and 17 real datasets were used to test the effectiveness of this method; a comparison with another five algorithms is also provided. The AUC value and F1 score on multiple datasets are higher than other algorithms, indicating that outliers can be found accurately, which proves that the algorithm is effective.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Data outlier detection algorithm based on density difference of double radius
    Department of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    不详
    不详
    [J]. Gaojishu Tongxin/Chinese High Technology Letters, 2008, 18 (04): : 350 - 354
  • [42] A non-parameter outlier detection algorithm based on Natural Neighbor
    Huang, Jinlong
    Zhu, Qingsheng
    Yang, Lijun
    Feng, Ji
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 92 : 71 - 77
  • [43] Weighted natural neighborhood graph: an adaptive structure for clustering and outlier detection with no neighborhood parameter
    Qingsheng Zhu
    Ji Feng
    Jinlong Huang
    [J]. Cluster Computing, 2016, 19 : 1385 - 1397
  • [44] Weighted natural neighborhood graph: an adaptive structure for clustering and outlier detection with no neighborhood parameter
    Zhu, Qingsheng
    Feng, Ji
    Huang, Jinlong
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (03): : 1385 - 1397
  • [45] A Network Anomaly Detection Algorithm based on Natural Neighborhood Graph
    Liu, Renyu
    Zhu, Qingsheng
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [46] An outlier detection algorithm based on an integrated outlier factor
    Zhou, Hongfang
    Liu, Hongjiang
    Zhang, Yingjie
    Zhang, Yao
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (05) : 975 - 990
  • [47] A New Neighborhood-Based Outlier Detection Technique
    Gupta, Umang
    Bhattacharjee, Vandana
    Bishnu, Partha Sarathi
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATION SYSTEMS, MCCS 2018, 2019, 556 : 527 - 534
  • [48] Outlier Detection based on K-Neighborhood MST
    Zhu, Qingsheng
    Fan, Xiaogang
    Feng, Ji
    [J]. 2014 IEEE 15TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2014, : 718 - 724
  • [49] Outlier detection method based on improved distance
    School of Computer Science and Engineering, South China University of Technology, Guangzhou 510640, China
    [J]. Huanan Ligong Daxue Xuebao, 2008, 9 (25-30):
  • [50] Density Based Outlier Detection Technique
    Gupta, Raghav
    Pandey, Kavita
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016, 2016, 433 : 51 - 58