WrapperRL: Reinforcement Learning Agent for Feature Selection in High-Dimensional Industrial Data

被引:0
|
作者
Shaer, Ibrahim [1 ]
Shami, Abdallah [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Noise measurement; Data models; Feature extraction; Time-frequency analysis; Streaming media; Object recognition; Standards; Deep learning; Anomaly detection; Reinforcement learning; Closed box; CNN model interpretability; anomaly detection; manufacturing noise;
D O I
10.1109/ACCESS.2024.3456688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding the set of discriminatory features in a classification task is imperative for the interpretability of the "black box" deep learning (DL) models, especially in high-stakes industrial applications such as predictive maintenance and industrial noise classification. In cases with time-series Time-Frequency (TF) domain data, the interpretability of DL models is challenged by the data's high dimensionality and the need to maintain the characteristics of the original signal when interpreting classification results. This paper devises a three-stage process that supports the interpretability of a DL model identifying industrial noise through a forward feature selection procedure. The first stage transforms the original TF data into an image representation. The second stage proposes a 2D Convolutional Neural Network (CNN) with a self-attention mechanism (SA-CNN) that classifies the data into instances with and without industrial noise. The final stage, termed WrapperRL, utilizes a Reinforcement Learning (RL) agent, to find the set of discriminatory frequency bands contributing to classification results. SA-CNN and WrapperRL both outperform the state-of-the-art implementations, each in their own specialty. The insights provided by WrapperRL suggest the contribution of around 20% of frequency bands to the existence of industrial noise, mainly residing in the low-frequency domain. Together, both of these approaches serve as a promising starting point for enhancing the interpretability of DL models and explaining the classification results of industrial TF data.
引用
收藏
页码:128338 / 128348
页数:11
相关论文
共 50 条
  • [1] Feature Selection and Feature Learning for High-dimensional Batch Reinforcement Learning: A Survey
    De-Rong Liu
    Hong-Liang Li
    Ding Wang
    [J]. International Journal of Automation and Computing, 2015, 12 (03) : 229 - 242
  • [2] Feature Selection and Feature Learning for High-dimensional Batch Reinforcement Learning: A Survey
    Liu, De-Rong
    Li, Hong-Liang
    Wang, Ding
    [J]. INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2015, 12 (03) : 229 - 242
  • [3] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [4] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [5] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    [J]. NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [6] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    [J]. ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964
  • [7] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    [J]. NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [8] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [9] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [10] Feature selection for high-dimensional temporal data
    Michail Tsagris
    Vincenzo Lagani
    Ioannis Tsamardinos
    [J]. BMC Bioinformatics, 19