A novel autoencoder approach to feature extraction with linear separability for high-dimensional data

被引:0
|
作者
Zheng J. [1 ]
Qu H. [1 ,2 ]
Li Z. [1 ]
Li L. [1 ]
Tang X. [2 ]
Guo F. [2 ]
机构
[1] College of Computer Science and Technology, Chongqing University of Post and Telecommunications, Chongqing
[2] College of Automation, Chongqing University of Posts and Telecommunications, Chongqing
基金
中国国家自然科学基金;
关键词
Autoencoder; Distance metric; Feature extraction;
D O I
10.7717/PEERJ-CS.1061
中图分类号
学科分类号
摘要
Feature extraction often needs to rely on sufficient information of the input data, however, the distribution of the data upon a high-dimensional space is too sparse to provide sufficient information for feature extraction. Furthermore, high dimensionality of the data also creates trouble for the searching of those features scattered in subspaces. As such, it is a tricky task for feature extraction from the data upon a high-dimensional space. To address this issue, this article proposes a novel autoencoder method using Mahalanobis distance metric of rescaling transformation. The key idea of the method is that by implementing Mahalanobis distance metric of rescaling transformation, the difference between the reconstructed distribution and the original distribution can be reduced, so as to improve the ability of feature extraction to the autoencoder. Results show that the proposed approach wins the state-of-the-art methods in terms of both the accuracy of feature extraction and the linear separabilities of the extracted features. We indicate that distance metric-based methods are more suitable for extracting those features with linear separabilities from high-dimensional data than feature selection-based methods. In a high-dimensional space, evaluating feature similarity is relatively easier than evaluating feature importance, so that distance metric methods by evaluating feature similarity gain advantages over feature selection methods by assessing feature importance for feature extraction, while evaluating feature importance is more computationally efficient than evaluating feature similarity. © 2022 Zheng et al.
引用
收藏
相关论文
共 50 条
  • [1] A novel autoencoder approach to feature extraction with linear separability for high-dimensional data
    Zheng, Jian
    Qu, Hongchun
    Li, Zhaoni
    Li, Lin
    Tang, Xiaoming
    Guo, Fei
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [2] Overfitting in linear feature extraction for classification of high-dimensional image data
    Liu, Raymond
    Gillies, Duncan F.
    PATTERN RECOGNITION, 2016, 53 : 73 - 86
  • [3] Feature extraction of linear separability using robust autoencoder with distance metric
    Wei, Pingping
    Zhang, Xin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7589 - 7598
  • [4] Wavelet feature extraction for high-dimensional microarray data
    Liu, Yihui
    NEUROCOMPUTING, 2009, 72 (4-6) : 985 - 990
  • [5] BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
    Valcarcel, Luis J.
    San Jose-Eneriz, Edurne L.
    Cendoya, Xabier
    Rubio, Angel L.
    Agirre, Xabier
    Prosper, Felipe L.
    Planes, Francisco
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (05)
  • [6] Unsupervised linear feature-extraction methods and their effects in the classification of high-dimensional data
    Jimenez-Rodriguez, Luis O.
    Arzuaga-Cruz, Emmanuel
    Velez-Reyes, Miguel
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (02): : 469 - 483
  • [7] A Fast Nonnegative Autoencoder-Based Approach to Latent Feature Analysis on High-Dimensional and Incomplete Data
    Bi, Fanghui
    He, Tiantian
    Luo, Xin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 733 - 746
  • [8] Locality sensitive batch feature extraction for high-dimensional data
    Ding, Jie
    Wen, Changyun
    Li, Guoqi
    Chua, Chin Seng
    NEUROCOMPUTING, 2016, 171 : 664 - 672
  • [9] Feature extraction in remote sensing high-dimensional image data
    Zortea, Maciel
    Haertel, Victor
    Clarke, Robin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2007, 4 (01) : 107 - 111
  • [10] Feature extraction and uncorrelated discriminant analysis for high-dimensional data
    Yang, Wen-Hui
    Dai, Dao-Qing
    Yan, Hong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (05) : 601 - 614