Unsupervised Outlier detection algorithm based on k-NN and fuzzy logic

被引:0
|
作者
Renan Velazquez-Gonzalez, J. [1 ]
Peregrina-Barreto, Hayde [1 ]
Fco Martinez-Trinidad, Jose [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Puebla, Mexico
关键词
Outlier detection; Fuzzy logic; k-Nearest Neighbor;
D O I
10.1109/ropec48299.2019.9057029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given a set of observations an outlier is a measurement that differs significantly from other observations. In a real application, what is sought is to eliminate them since their processing implies statistical errors. Although there are several works that have addressed the outlier detection challenge, in recent works, efforts have been focused to unsupervised scenario because it does not require any a priori knowledge of data distributions and is more attached to reality. Unfortunately, unsupervised approaches have limitations under complex datasets. In order to solve this problem, we propose the use of K-NN rule and fuzzy logic for outlier detection. First, the proposed algorithm is evaluated by using synthetic data; after, the Harvard Unsupervised Anomaly Detection Benchmark Dataset, which consists of several complex data structures based in real-world applications, is used. In comparison with the current works, our algorithm outperforms most previous works for the Harvard Breast cancer dataset dataset (ROC score equal to 0.9980) while for the Harvard Pen Global dataset our algorithm achieves relatively higher accuracy (more accurate than some previous works) and similar results than most accurate algorithms in the current literature.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [21] Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
    Ali, Munwar
    Jung, Low Tang
    Abdel-Aty, Abdel-Haleem
    Abubakar, Mustapha Y.
    Elhoseny, Mohamed
    Ali, Irfan
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 151
  • [22] New techniques for efficiently k-NN algorithm for brain tumor detection
    Soobia Saeed
    Afnizanfaizal Abdullah
    N. Z. Jhanjhi
    Mehmood Naqvi
    Anand Nayyar
    Multimedia Tools and Applications, 2022, 81 : 18595 - 18616
  • [23] New techniques for efficiently k-NN algorithm for brain tumor detection
    Saeed, Soobia
    Abdullah, Afnizanfaizal
    Jhanjhi, N. Z.
    Naqvi, Mehmood
    Nayyar, Anand
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 18595 - 18616
  • [24] OUTLIER DETECTION AND FUZZY LOGIC ALGORITHM FOR PERFORMANCE EVALUATION PROBLEM
    Megala, N.
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING, 2009, : 169 - 173
  • [25] A case based method to predict optimal k value for k-NN algorithm
    Yang Zhongguo
    Li Hongqi
    Zhu Liping
    Liu Qiang
    Ali, Sikandar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (01) : 55 - 65
  • [26] Intrusion Detection System using Genetic Algorithm and K-NN Algorithm on Dos Attack
    Fauzi, Muhammad Akmal
    Hanuranto, Ahmad Tri
    Setianingsih, Casi
    PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2020, : 323 - 328
  • [27] Fast multistage algorithm for K-NN classifiers
    Soraluze, I
    Rodriguez, C
    Boto, F
    Cortes, A
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 448 - 455
  • [28] Improving k-NN by using fuzzy similarity functions
    Morell, C
    Bello, R
    Grau, R
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2004, 2004, 3315 : 708 - 716
  • [29] Protein subcellular location prediction using optimally weighted fuzzy k-NN algorithm
    Nasibov, Efendi
    Kandemir-Cavas, Cagin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (06) : 448 - 451
  • [30] ML based modulation format identifier using K-NN algorithm
    Debnath, Suman
    Sinha, Nitish
    Bhowmik, Bishanka Brata
    MATERIALS TODAY-PROCEEDINGS, 2022, 65 : 2626 - 2630