A Fast Nonnegative Autoencoder-Based Approach to Latent Feature Analysis on High-Dimensional and Incomplete Data

被引:5
|
作者
Bi, Fanghui [1 ,2 ]
He, Tiantian [3 ]
Luo, Xin [4 ]
机构
[1] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
[2] Univ Chinese Acad Sci, Chongqing Sch, Chongqing 400714, Peoples R China
[3] ASTAR, Inst High Performance Comp, Singapore Inst Mfg Technol, Ctr Frontier AI Res, Singapore 699010, Singapore
[4] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Knowledge acquisition; data science; high-dimensional and incomplete data; neural network; fast nonnegative AutoEncoder; latent feature analysis; link prediction; network representation learning; MATRIX; FACTORIZATION; NETWORK;
D O I
10.1109/TSC.2023.3319713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-Dimensional and Incomplete (HDI) data are frequently encountered in various Big Data-related applications. Despite its incompleteness, an HDI data repository contains rich knowledge and patterns concerning the complex interactions among numerous nodes. Recently, a Neural Network (NN)-based approach to Latent Feature Analysis (LFA) model becomes popular owing to its strong representation learning ability to HDI data. Nevertheless, existing NN-based LFA models neglect the inherent nonnegativity in most HDI data, resulting in representation accuracy loss. Motivated by this discovery, this study innovatively proposes a <bold> F</bold>ast <bold> N</bold>onnegative <bold> A</bold>uto<bold> E</bold>ncoder (FNAE)-based approach to LFA on HDI data, whose ideas are three-fold: a) constructing a multilayered autoencoder subject to nonnegativity constraints for high representation learning ability; b) incorporating the data density-oriented modeling mechanism into FNAE's input and output layers for high computational and storage efficiency; and c) implementing an Adam-based single latent factor-dependent, nonnegative and multiplicative update algorithm for efficient model training as well as fulfilling the nonnegativity constraints. Experimental results on eight commonly-adopted HDI matrices from industrial applications demonstrate that the proposed FNAE significantly outperforms several state-of-the-art NN-based LFA models in both estimation accuracy for missing links of an HDI matrix and computational efficiency.
引用
收藏
页码:733 / 746
页数:14
相关论文
共 50 条
  • [1] Variational autoencoder-based outlier detection for high-dimensional data
    Li, Yongmou
    Wang, Yijie
    Ma, Xingkong
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (05) : 991 - 1002
  • [2] MMLF: Multi-Metric Latent Feature Analysis for High-Dimensional and Incomplete Data
    Wu, Di
    Zhang, Peng
    He, Yi
    Luo, Xin
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (02) : 575 - 588
  • [3] A novel autoencoder approach to feature extraction with linear separability for high-dimensional data
    Zheng, Jian
    Qu, Hongchun
    Li, Zhaoni
    Li, Lin
    Tang, Xiaoming
    Guo, Fei
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [4] A novel autoencoder approach to feature extraction with linear separability for high-dimensional data
    Zheng, Jian
    Qu, Hongchun
    Li, Zhaoni
    Li, Lin
    Tang, Xiaoming
    Guo, Fei
    [J]. PeerJ Computer Science, 2022, 8
  • [5] Variational Autoencoder-Based Dimensionality Reduction for High-Dimensional Small-Sample Data Classification
    Mahmud, Mohammad Sultan
    Huang, Joshua Zhexue
    Fu, Xianghua
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2020, 19 (01)
  • [6] An Outlier-Resilient Autoencoder for Representing High-Dimensional and Incomplete Data
    Wu, Di
    Hu, Yuanpeng
    Liu, Kechen
    Li, Jing
    Wang, Xianmin
    Deng, Song
    Zheng, Nenggan
    Luo, Xin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [7] Autoencoder-based outlier detection for sparse, high dimensional data
    Chen, Wanghu
    Li, Huijun
    Li, Jing
    Arshad, Ali
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2735 - 2742
  • [8] Feature Selection and Classification for High-Dimensional Incomplete Multimodal Data
    Deng, Wan-Yu
    Liu, Dan
    Dong, Ying-Ying
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [9] MMA: Multi-Metric-Autoencoder for Analyzing High-Dimensional and Incomplete Data
    Liang, Cheng
    Wu, Di
    He, Yi
    Huang, Teng
    Chen, Zhong
    Luo, Xin
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 3 - 19
  • [10] Latent Feature Group Learning for High-Dimensional Data Clustering
    Wang, Wenting
    He, Yulin
    Ma, Liheng
    Huang, Joshua Zhexue
    [J]. INFORMATION, 2019, 10 (06):