Unsupervised Outlier detection algorithm based on k-NN and fuzzy logic

被引:0
|
作者
Renan Velazquez-Gonzalez, J. [1 ]
Peregrina-Barreto, Hayde [1 ]
Fco Martinez-Trinidad, Jose [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Puebla, Mexico
关键词
Outlier detection; Fuzzy logic; k-Nearest Neighbor;
D O I
10.1109/ropec48299.2019.9057029
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given a set of observations an outlier is a measurement that differs significantly from other observations. In a real application, what is sought is to eliminate them since their processing implies statistical errors. Although there are several works that have addressed the outlier detection challenge, in recent works, efforts have been focused to unsupervised scenario because it does not require any a priori knowledge of data distributions and is more attached to reality. Unfortunately, unsupervised approaches have limitations under complex datasets. In order to solve this problem, we propose the use of K-NN rule and fuzzy logic for outlier detection. First, the proposed algorithm is evaluated by using synthetic data; after, the Harvard Unsupervised Anomaly Detection Benchmark Dataset, which consists of several complex data structures based in real-world applications, is used. In comparison with the current works, our algorithm outperforms most previous works for the Harvard Breast cancer dataset dataset (ROC score equal to 0.9980) while for the Harvard Pen Global dataset our algorithm achieves relatively higher accuracy (more accurate than some previous works) and similar results than most accurate algorithms in the current literature.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A hybrid method based on artificial immune system and fuzzy k-NN algorithm for diagnosis of heart valve diseases
    Sengur, Abdulkadir
    Turkoglu, Ibrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1011 - 1020
  • [42] An improved k-NN algorithm for localization in multipath environments
    Yang Zhao
    Kaihua Liu
    Yongtao Ma
    Zhuo Li
    EURASIP Journal on Wireless Communications and Networking, 2014
  • [43] GPU based Cloud system for high-performance arrhythmia detection with parallel k-NN algorithm
    Jun, Tae Joon
    Park, Hyun Ji
    Yoo, Hyuk
    Kim, Young-Hak
    Kim, Daeyoung
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 5327 - 5330
  • [44] Implementation of K-NN Based on Histogram at Image Recognition for Pornography Detection
    Nuraisha, Safira
    Pratama, Fandy Indra
    Budianita, Avira
    Soeleman, M. Arief
    2017 INTERNATIONAL SEMINAR ON APPLICATION FOR TECHNOLOGY OF INFORMATION AND COMMUNICATION (ISEMANTIC), 2017, : 5 - 10
  • [45] A Novel Unsupervised 2-Stage k-NN Re-ranking Algorithm for Image Retrieval
    Li, Dawei
    Chuah, Mooi Choo
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 160 - 165
  • [46] A Framework for a Decision Tree Learning Algorithm with K-NN
    Kurematsu, Masaki
    Hakura, Jun
    Fujita, Hamido
    INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, SOMET 2014, 2015, 513 : 39 - 51
  • [47] A modification of the LAESA algorithm for approximated k-NN classification
    Moreno-Seco, F
    Micó, L
    Oncina, J
    PATTERN RECOGNITION LETTERS, 2003, 24 (1-3) : 47 - 53
  • [48] Accelerating k-NN Algorithm with Hybrid MPI and OpenSHMEM
    Lin, Jian
    Hamidouche, Khaled
    Zhang, Jie
    Lu, Xiaoyi
    Vishnu, Abhinav
    Panda, Dhabaleswar
    OPENSHMEM AND RELATED TECHNOLOGIES: EXPERIENCES, IMPLEMENTATIONS, AND TECHNOLOGIES, OPENSHMEM 2015, 2015, 9397 : 164 - 177
  • [49] <bold>AN OPTIMIZATION ALGORITHM OF K-NN CLASSIFICATION</bold>
    Zhan, Yan
    Chen, Hao
    Zhang, Guo-Chun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2246 - +
  • [50] An improved k-NN algorithm for localization in multipath environments
    Zhao, Yang
    Liu, Kaihua
    Ma, Yongtao
    Li, Zhuo
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2014, : 1 - 10