WrapperRL: Reinforcement Learning Agent for Feature Selection in High-Dimensional Industrial Data

被引:0
|
作者
Shaer, Ibrahim [1 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Noise measurement; Data models; Feature extraction; Time-frequency analysis; Streaming media; Object recognition; Standards; Deep learning; Anomaly detection; Reinforcement learning; Closed box; CNN model interpretability; anomaly detection; manufacturing noise;
D O I
10.1109/ACCESS.2024.3456688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding the set of discriminatory features in a classification task is imperative for the interpretability of the "black box" deep learning (DL) models, especially in high-stakes industrial applications such as predictive maintenance and industrial noise classification. In cases with time-series Time-Frequency (TF) domain data, the interpretability of DL models is challenged by the data's high dimensionality and the need to maintain the characteristics of the original signal when interpreting classification results. This paper devises a three-stage process that supports the interpretability of a DL model identifying industrial noise through a forward feature selection procedure. The first stage transforms the original TF data into an image representation. The second stage proposes a 2D Convolutional Neural Network (CNN) with a self-attention mechanism (SA-CNN) that classifies the data into instances with and without industrial noise. The final stage, termed WrapperRL, utilizes a Reinforcement Learning (RL) agent, to find the set of discriminatory frequency bands contributing to classification results. SA-CNN and WrapperRL both outperform the state-of-the-art implementations, each in their own specialty. The insights provided by WrapperRL suggest the contribution of around 20% of frequency bands to the existence of industrial noise, mainly residing in the low-frequency domain. Together, both of these approaches serve as a promising starting point for enhancing the interpretability of DL models and explaining the classification results of industrial TF data.
引用
收藏
页码:128338 / 128348
页数:11
相关论文
共 50 条
  • [41] Scalable Feature Selection in High-Dimensional Data Based on GRASP
    Moshki, Mohsen
    Kabiri, Peyman
    Mohebalhojeh, Alireza
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (03) : 283 - 296
  • [42] Joint image clustering and feature selection with auto-adjoined learning for high-dimensional data
    Wang, Xiaodong
    Wu, Pengtao
    Xu, Qinghua
    Zeng, Zhiqiang
    Xie, Yong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [43] A novel feature learning framework for high-dimensional data classification
    Yanxia Li
    Yi Chai
    Hongpeng Yin
    Bo Chen
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 555 - 569
  • [44] A novel feature learning framework for high-dimensional data classification
    Li, Yanxia
    Chai, Yi
    Yin, Hongpeng
    Chen, Bo
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 555 - 569
  • [45] Latent Feature Group Learning for High-Dimensional Data Clustering
    Wang, Wenting
    He, Yulin
    Ma, Liheng
    Huang, Joshua Zhexue
    [J]. INFORMATION, 2019, 10 (06):
  • [46] Bird's Eye View feature selection for high-dimensional data
    Belhaouari, Samir Brahim
    Shakeel, Mohammed Bilal
    Erbad, Aiman
    Oflaz, Zarina
    Kassoul, Khelil
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [47] Feature selection using autoencoders with Bayesian methods to high-dimensional data
    Shu, Lei
    Huang, Kun
    Jiang, Wenhao
    Wu, Wenming
    Liu, Hongling
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7397 - 7406
  • [48] The feature selection bias problem in relation to high-dimensional gene data
    Krawczuk, Jerzy
    Lukaszuk, Tomasz
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 66 : 63 - 71
  • [49] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [50] Feature Selection for High-Dimensional Data Through Instance Vote Combining
    Chamakura, Lily
    Saha, Goutam
    [J]. PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 161 - 169