A hybrid dimensionality reduction method for outlier detection in high-dimensional data

被引:1
|
作者
Meng, Guanglei [1 ]
Wang, Biao [1 ]
Wu, Yanming [1 ]
Zhou, Mingzhe [1 ]
Meng, Tiankuo [1 ]
机构
[1] Shenyang Aerosp Univ, Sch Automat, Shenyang 110136, Peoples R China
基金
美国国家科学基金会;
关键词
Outlier detection; Anomaly detection; Dimensionality reduction; High-dimensional data; Ensemble learning; FEATURE-EXTRACTION; ENSEMBLE; PCA;
D O I
10.1007/s13042-023-01859-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection becomes challenging when data are featured by high-dimension. Using dimensionality reduction (DR) techniques to discard the irrelevant attributes is a straightforward solution. However, it appears to be rather difficult for single DR algorithm to discover all outliers, owing to the rarity, heterogeneity, and boundless nature of outliers. In this paper, we propose a hybrid DR method dedicated to outlier detection base on ensemble learning. Multiple algorithms with different specifications of parameters are used to generate accurate and diverse base detectors at the phase of ensemble generation. A two-stage combination function is used at the phase of ensemble combination. Both variance reduction and bias reduction are taken into account in our framework. More importantly, the high flexibility of the proposed detection framework implies that any outlier detection algorithm can be applicable. 15 high-dimensional data sets from KEEL repository and one image data set are used to validate the performance of our method. One semi-supervised and one unsupervised outlier detection algorithms are used in separate experiments. In spite of subtle differences, the advantage of our method has been approved by both experiments. Moreover, contributions of two ingredients of our method are also verified via two pairs of experimental comparisons.
引用
收藏
页码:3705 / 3718
页数:14
相关论文
共 50 条
  • [41] Distance-preserving projection of high-dimensional data for nonlinear dimensionality reduction
    Yang, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1243 - 1246
  • [42] Comparing and Exploring High-Dimensional Data with Dimensionality Reduction Algorithms and Matrix Visualizations
    Cutura, Rene
    Aupetit, Michael
    Fekete, Jean-Daniel
    Sedlmair, Michael
    [J]. PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,
  • [43] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    [J]. NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [44] IPMOD: An efficient outlier detection model for high-dimensional medical data streams
    Yang, Yun
    Fan, ChongJun
    Chen, Liang
    Xiong, HongLin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [45] Weighted Outlier Detection of High-Dimensional Categorical Data Using Feature Grouping
    Li, Junli
    Zhang, Jifu
    Pang, Ning
    Qin, Xiao
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4295 - 4308
  • [46] Computationally Efficient Outlier Detection for High-Dimensional Data Using the MDP Algorithm
    Tsagris, Michail
    Papadakis, Manos
    Alenazi, Abdulaziz
    Alzeley, Omar
    [J]. COMPUTATION, 2024, 12 (09)
  • [47] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [48] Projected outlier detection in high-dimensional mixed-attributes data set
    Ye, Mao
    Li, Xue
    Orlowska, Maria E.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 7104 - 7113
  • [49] An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data
    Hoang Vu Nguyen
    Gopalkrishnan, Vivekanand
    Assent, Ira
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 138 - +
  • [50] Outlier Detection in the Framework of Dimensionality Reduction
    Ye, Qiang
    Zhi, Weifeng
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (04)