Unsupervised Outlier detection algorithm based on k-NN and fuzzy logic

被引:0
|
作者
Renan Velazquez-Gonzalez, J. [1 ]
Peregrina-Barreto, Hayde [1 ]
Fco Martinez-Trinidad, Jose [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Puebla, Mexico
关键词
Outlier detection; Fuzzy logic; k-Nearest Neighbor;
D O I
10.1109/ropec48299.2019.9057029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given a set of observations an outlier is a measurement that differs significantly from other observations. In a real application, what is sought is to eliminate them since their processing implies statistical errors. Although there are several works that have addressed the outlier detection challenge, in recent works, efforts have been focused to unsupervised scenario because it does not require any a priori knowledge of data distributions and is more attached to reality. Unfortunately, unsupervised approaches have limitations under complex datasets. In order to solve this problem, we propose the use of K-NN rule and fuzzy logic for outlier detection. First, the proposed algorithm is evaluated by using synthetic data; after, the Harvard Unsupervised Anomaly Detection Benchmark Dataset, which consists of several complex data structures based in real-world applications, is used. In comparison with the current works, our algorithm outperforms most previous works for the Harvard Breast cancer dataset dataset (ROC score equal to 0.9980) while for the Harvard Pen Global dataset our algorithm achieves relatively higher accuracy (more accurate than some previous works) and similar results than most accurate algorithms in the current literature.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [1] An optimally weighted fuzzy k-NN algorithm
    Pham, TD
    PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 239 - 247
  • [2] Fuzzy k-NN applied to moulds detection
    Kuske, M
    Rubio, R
    Romain, AC
    Nicolas, J
    Marco, S
    SENSORS AND ACTUATORS B-CHEMICAL, 2005, 106 (01) : 52 - 60
  • [3] Using Genetic Algorithm to Improve Fuzzy k-NN
    Zhang Juan
    Niu Yi
    He Wenbin
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 475 - +
  • [4] Two manufacturing applications of the fuzzy K-NN algorithm
    Liao, TW
    Li, DM
    FUZZY SETS AND SYSTEMS, 1997, 92 (03) : 289 - 303
  • [5] Fuzzy k-NN SVM
    Cheng, Hui-Chuan
    Yang, Chan-Yun
    Jan, Gene Eu
    Chen, Angela Shin-yih
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1227 - 1232
  • [6] Research of liver cancer detection based on improved K-NN algorithm
    Liu, Jianhua
    Wang, Jianwei
    Bu, Wenjuan
    Journal of Chemical and Pharmaceutical Research, 2014, 6 (05) : 360 - 367
  • [7] K-NN algorithm based on neural similarity
    Lazzerini, B
    Marcelloni, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE SYSTEMS, PROCEEDINGS, 2002, : 67 - 70
  • [8] Wearable Method for Fall Detection Based on Kalman Filter and k-NN Algorithm
    He J.
    Zhou M.
    Wang X.
    Wang, Xiaoyi (wxy@mail.bjut.edu.cn), 1600, Science Press (39): : 2627 - 2634
  • [9] A FUZZY VERSION OF THE K-NN METHOD
    KISSIOV, VT
    HADJITODOROV, ST
    FUZZY SETS AND SYSTEMS, 1992, 49 (03) : 323 - 329
  • [10] Fraud Detection in Shipping Industry using K-NN Algorithm
    Subramaniam, Ganesan
    Mahmoud, Moamin A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 466 - 475