Unsupervised Outlier Detection: A Meta-Learning Algorithm Based on Feature Selection

被引:7
|
作者
Papastefanopoulos, Vasilis [1 ]
Linardatos, Pantelis [1 ]
Kotsiantis, Sotiris [1 ]
机构
[1] Univ Patras, Dept Math, Patras 26504, Greece
关键词
machine learning; data science; unsupervised outlier; detection; meta-learning; feature selection; ensemble-learning;
D O I
10.3390/electronics10182236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection refers to the problem of the identification and, where appropriate, the elimination of anomalous observations from data. Such anomalous observations can emerge due to a variety of reasons, including human or mechanical errors, fraudulent behaviour as well as environmental or systematic changes, occurring either naturally or purposefully. The accurate and timely detection of deviant observations allows for the early identification of potentially extensive problems, such as fraud or system failures, before they escalate. Several unsupervised outlier detection methods have been developed; however, there is no single best algorithm or family of algorithms, as typically each relies on a measure of 'outlierness' such as density or distance, ignoring other measures. To add to that, in an unsupervised setting, the absence of ground-truth labels makes finding a single best algorithm an impossible feat even for a single given dataset. In this study, a new meta-learning algorithm for unsupervised outlier detection is introduced in order to mitigate this problem. The proposed algorithm, in a fully unsupervised manner, attempts not only to combine the best of many worlds from the existing techniques through ensemble voting but also mitigate any undesired shortcomings by employing an unsupervised feature selection strategy in order to identify the most informative algorithms for a given dataset. The proposed methodology was evaluated extensively through experimentation, where it was benchmarked and compared against a wide range of commonly-used techniques for outlier detection. Results obtained using a variety of widely accepted datasets demonstrated its usefulness and its state-of-the-art results as it topped the Friedman ranking test for both the area under receiver operating characteristic (ROC) curve and precision metrics when averaged over five independent trials.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A PCA Based Unsupervised Feature Selection Algorithm
    Luo, Yihui
    Xiong, Shuchu
    Wang, Sichuan
    [J]. SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 299 - 302
  • [22] Unsupervised Feature Selection for Outlier Detection in Categorical Data using Mutual Information
    Suri, N. N. R. Ranga
    Murty, M. Narasimha
    Athithan, G.
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 253 - 258
  • [23] Unsupervised Feature Selection for Outlier Detection on Streaming Data to Enhance Network Security
    Heigl, Michael
    Weigelt, Enrico
    Fiala, Dalibor
    Schramm, Martin
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [24] Meta-learning methodology based on meta-unsupervised algorithm for meta-model selection to solve few-shot base-tasks
    Eduardo Rivas-Posada
    Mario I. Chacon-Murguia
    [J]. Neural Computing and Applications, 2024, 36 : 9073 - 9094
  • [25] Feature selection method based on unsupervised learning
    Zhang Li
    Sun Gang
    Guo Jun
    [J]. PROCEEDINGS OF 2004 CHINESE CONTROL AND DECISION CONFERENCE, 2004, : 218 - 220
  • [26] Meta-learning Based Evolutionary Clustering Algorithm
    Tomp, Dmitry
    Muravyov, Sergey
    Filchenkov, Andrey
    Parfenov, Vladimir
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 502 - 513
  • [27] Critical feature identification and meta-learning method for islanding detection
    Zhang, Peichao
    Tan, Xiaofeng
    Yang, Peixin
    [J]. Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2014, 38 (18): : 72 - 78
  • [28] Deepfake detection using deep feature stacking and meta-learning
    Naskar, Gourab
    Mohiuddin, Sk
    Malakar, Samir
    Cuevas, Erik
    Sarkar, Ram
    [J]. HELIYON, 2024, 10 (04)
  • [29] Reducing cognitive overload by meta-learning assisted algorithm selection
    Fan, Lisa
    Lei, Minxiao
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 120 - 125
  • [30] CMVAE: Causal Meta VAE for Unsupervised Meta-Learning
    Qi, Guodong
    Yu, Huimin
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9480 - 9488