Hybrid Feature Selection for High-Dimensional Manufacturing Data

被引:2
|
作者
Sun, Yajuan [1 ]
Yu, Jianlin [2 ]
Li, Xiang [1 ]
Wu, Ji Yan [2 ]
Lu, Wen Feng [2 ]
机构
[1] A STAR Singapore Inst Mfg Technol, 2 Fusionopolis Way,08-04 Innovis, Singapore 138634, Singapore
[2] Natl Univ Singapore, Dept Mech Engn, Block EA, Singapore 117575, Singapore
关键词
Feature selection; Wrapper Method; High-Dimensional Manufacturing Data; MUTUAL INFORMATION; RELIEFF;
D O I
10.1109/ETFA45728.2021.9613547
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In manufacturing environment, hundreds of input parameters are related to product quality. To build an accurate machine learning model for quality prediction, it is necessary to find major input parameters which have a big influence in quality prediction. The procedure of identifying major factors out of original high-dimensional input parameters is called to be feature selection. This paper proposes a hybrid method for feature selection, which effectively reduces the searching space by leveraging feature subset chosen by Fast Correlation Based Filter (FCBF) and Relief-based feature selection. The computational complexity is proved to be quadratic in feature number, while most of the existing methods suffer from exponential computation complexity. This improvement is crucial especially when we deal with high-dimensional input parameters because it dramatically reduces the computational time. Further, the proposed method outperforms in prediction accuracy as well when it compares with the benchmarking method. It has been demonstrated by the implementation of our method into real-world manufacturing data sets and open source benchmarking data set.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Scalable Feature Selection in High-Dimensional Data Based on GRASP
    Moshki, Mohsen
    Kabiri, Peyman
    Mohebalhojeh, Alireza
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (03) : 283 - 296
  • [42] A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering
    Boutemedjet, Sabri
    Bouguila, Nizar
    Ziou, Djemel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (08) : 1429 - 1443
  • [43] Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data
    Elnaz Pashaei
    Elham Pashaei
    [J]. Neural Computing and Applications, 2023, 35 : 353 - 374
  • [44] Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data
    Pashaei, Elnaz
    Pashaei, Elham
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 353 - 374
  • [45] An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
    Zhu, Yongbin
    Li, Tao
    Li, Wenshan
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [46] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808
  • [47] Bird's Eye View feature selection for high-dimensional data
    Belhaouari, Samir Brahim
    Shakeel, Mohammed Bilal
    Erbad, Aiman
    Oflaz, Zarina
    Kassoul, Khelil
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [48] A hybrid feature weighting and selection-based strategy to classify the high-dimensional and imbalanced medical data
    Harpreet Singh
    Manpreet Kaur
    Birmohan Singh
    [J]. Neural Computing and Applications, 2024, 36 (20) : 12299 - 12316
  • [49] Hybrid binary arithmetic optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical data
    Pashaei, Elham
    Pashaei, Elnaz
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (13): : 15598 - 15637
  • [50] Feature selection using autoencoders with Bayesian methods to high-dimensional data
    Shu, Lei
    Huang, Kun
    Jiang, Wenhao
    Wu, Wenming
    Liu, Hongling
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7397 - 7406