Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

被引:4
|
作者
Bezerra, Francisco Elanio [1 ]
de Oliveira Neto, Geraldo Cardoso [2 ]
Cervi, Gabriel Magalhaes [3 ]
Mazetto, Rafaella Francesconi [3 ]
de Faria, Aline Mariane [3 ]
Vido, Marcos [4 ]
Lima, Gustavo Araujo [5 ]
de Araujo, Sidnei Alves [5 ]
Sampaio, Mauro [6 ]
Amorim, Marlene [7 ]
机构
[1] Univ Sao Paulo, Polytech Sch, Dept Energy Engn & Elect Automat, 158 Prof Luciano Gualberto Ave, BR-05508010 Sao Paulo, Brazil
[2] Fed Univ ABC, Alameda Univ, Ind Engn Post Grad Program, S-n Bairro Anchieta, BR-09606045 Sao Bernardo Do Campo, SP, Brazil
[3] FEI Univ, Business Adm Postgrad Program, Tamandare St 688,5 Floor, BR-01525000 Sao Paulo, Brazil
[4] Nove de Julho Univ UNINOVE, Ind Engn Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil
[5] Nove de Julho Univ UNINOVE, Informat & Knowledge Management Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil
[6] FEI Univ, Ind Engn Postgrad Program, Ave Humberto Alencar Castelo Branco 3972-B, BR-09850901 Sao Bernardo Do Campo, Brazil
[7] Univ Aveiro, GOVCOPP DEGEIT, P-3810193 Aveiro, Portugal
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 08期
关键词
machine learning; machine failure; feature selection; predictive maintenance; sensor selection; CLASSIFICATION; NETWORKS;
D O I
10.3390/app14083337
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the context of Industry 4.0, managing large amounts of data is essential to ensure informed decision-making in intelligent production environments. It enables, for example, predictive maintenance, which is essential for anticipating and identifying causes of failures in machines and equipment, optimizing processes, and promoting proactive management of human, financial, and material resources. However, generating accurate information for decision-making requires adopting suitable data preprocessing and analysis techniques. This study explores the identification of machine failures based on synthetic industrial data. Initially, we applied the feature selection techniques Principal Component Analysis (PCA), Minimum Redundancy Maximum Relevance (mRMR), Neighborhood Component Analysis (NCA), and Denoising Autoencoder (DAE) to the collected data and compared their results. In the sequence, a comparison among three widely known machine learning classifiers, namely Random Forest (RF), Support Vector Machine (SVM), and Multilayer Perceptron neural network (MLP), was conducted, with and without considering feature selection. The results showed that PCA and RF were superior to the other techniques, allowing the classification of failures with rates of 0.98, 0.97, and 0.98 for the accuracy, precision, and recall metrics, respectively. Thus, this work contributes by solving an industrial problem and detailing techniques to identify the most relevant variables and machine learning algorithms for predicting machine failures that negatively impact production planning. The findings provided by this study can assist industries in giving preference to employing sensors and collecting data that can contribute more effectively to machine failure predictions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Feature Selection and Intrusion Detection in Cloud Environment based on Machine Learning Algorithms
    Javadpour, Amir
    Abharian, Sanaz Kazemi
    Wang, Guojun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 1417 - 1421
  • [22] Investigation of machine learning algorithms on heart disease through dominant feature detection and feature selection
    Fuat Türk
    Signal, Image and Video Processing, 2024, 18 : 3943 - 3955
  • [23] Predicting cardiovascular disease by combining optimal feature selection methods with machine learning
    Rodriguez Segura, Mauricio
    Nicolis, Orietta
    Peralta Marquez, Billy
    Carrillo Azocar, Juan
    2020 39TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2020,
  • [24] Investigation of machine learning algorithms on heart disease through dominant feature detection and feature selection
    Turk, Fuat
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3943 - 3955
  • [25] Machine Learning Feature Selection for Predicting High Concentration Therapeutic Antibody Aggregation
    Lai, Pin-Kuang
    Fernando, Amendra
    Cloutier, Theresa K.
    Kingsbury, Jonathan S.
    Gokarn, Yatin
    Halloran, Kevin T.
    Calero-Rubio, Cesar
    Trout, Bernhardt L.
    JOURNAL OF PHARMACEUTICAL SCIENCES, 2021, 110 (04) : 1583 - 1591
  • [26] Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms
    Senan, Ebrahim Mohammed
    Abunadi, Ibrahim
    Jadhav, Mukti E.
    Fati, Suliman Mohamed
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [27] Machine learning algorithms for predicting scapular kinematics
    Nicholson, Kristen F.
    Richardson, R. Tyler
    van Roden, Elizabeth A. Rapp
    Quinton, R. Garry
    Anzilotti, Kert F.
    Richards, James G.
    MEDICAL ENGINEERING & PHYSICS, 2019, 65 : 39 - 45
  • [28] Predicting property prices with machine learning algorithms
    Ho, Winky K. O.
    Tang, Bo-Sin
    Wong, Siu Wai
    JOURNAL OF PROPERTY RESEARCH, 2021, 38 (01) : 48 - 70
  • [29] Machine learning algorithms for predicting rainfall in India
    Garai, Sandi
    Paul, Ranjit Kumar
    Yeasin, Md.
    Roy, H. S.
    Paul, A. K.
    CURRENT SCIENCE, 2024, 126 (03): : 360 - 367
  • [30] A Comprehensive Review of Feature Selection and Feature Selection Stability in Machine Learning
    Buyukkececi, Mustafa
    Okur, Mehmet Cudi
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2023, 36 (04): : 1506 - 1520