RDPOD: an unsupervised approach for outlier detection

被引:4
|
作者
Abhaya, Abhaya [1 ]
Patra, Bidyut Kr [1 ]
机构
[1] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 02期
关键词
Outlier detection; Local outlier factor (LOF); Local distance-based outlier factor (LDOF); Relative density-based factor (RDOS); Natural outlier factor (NOF); Symmetric neighborhood (INFLO); Density peaks clustering; NEAREST NEIGHBORS; DENSITY; DISTANCE;
D O I
10.1007/s00521-021-06432-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outliers are the data points which deviate significantly from the majority of the data points. Finding outliers is an important task in various applications, especially in data mining. The unsupervised technique is very popular to mine outliers in a dataset over supervised techniques. Various unsupervised approaches have been proposed over the last decades. Clustering-based, distance-based, and density-based outlier approaches are found to be successful for detecting outlier points. However, the main focus of clustering-based method is to identifying clustering structure. Many distance-based and density-based techniques are not suitable for varying density datasets, and they are also very sensitive with their parameter (number of nearest-neighbor (k)). In this paper, we propose a hybrid approach named RDPOD, which utilizes distance-based and density-based clustering approaches efficiently for identifying the density of each point correctly. We obtain local density and relative distance of each data instance. From this density and distance information, we identify outlier points. Experimental results with real-world datasets show that our proposed approach outperforms the popular techniques LOF, LDOF, symmetric neighborhood, and recently introduced approaches NOF and RDOS.
引用
收藏
页码:1065 / 1077
页数:13
相关论文
共 50 条
  • [1] RDPOD: an unsupervised approach for outlier detection
    Abhaya Abhaya
    Bidyut Kr. Patra
    [J]. Neural Computing and Applications, 2022, 34 : 1065 - 1077
  • [2] Unsupervised approach for online outlier detection in industrial process data
    Bechny, Michal
    Himmelbauer, Johannes
    [J]. 3RD INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, 2022, 200 : 257 - 266
  • [3] Unsupervised outlier detection in multidimensional data
    Atiq ur Rehman
    Samir Brahim Belhaouari
    [J]. Journal of Big Data, 8
  • [4] A new unsupervised outlier detection method
    Zheng, Lina
    Chen, Lijun
    Wang, Yini
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 1713 - 1734
  • [5] On the Internal Evaluation of Unsupervised Outlier Detection
    Marques, Henrique O.
    Campello, Ricardo J. G. B.
    Zimek, Arthur
    Sander, Jorg
    [J]. PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [6] Internal Evaluation of Unsupervised Outlier Detection
    Marques, Henrique O.
    Campello, Ricardo J. G. B.
    Sander, Jorg
    Zimek, Arthur
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (04)
  • [7] Unsupervised outlier detection in multidimensional data
    Ur Rehman, Atiq
    Belhaouari, Samir Brahim
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [8] Bagged Subspaces for Unsupervised Outlier Detection
    Pasillas-Diaz, Jose Ramon
    Ratte, Sylvie
    [J]. COMPUTATIONAL INTELLIGENCE, 2017, 33 (03) : 507 - 523
  • [9] An Unsupervised Approach for Combining Scores of Outlier Detection Techniques, Based on Similarity Measures
    Pasillas-Diaz, Jose Ramon
    Ratte, Sylvie
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2016, 329 : 61 - 77
  • [10] Multivariate functional outlier detection using the fast massive unsupervised outlier detection indices
    Ojo, Oluwasegun Taiwo
    Anta, Antonio Fernandez
    Genton, Marc G.
    Lillo, Rosa E.
    [J]. STAT, 2023, 12 (01):