Product failure prediction with missing data

被引:19
|
作者
Kang, Seokho [1 ]
Kim, Eunji [2 ,3 ]
Shim, Jaewoong [2 ,3 ]
Chang, Wonsang [4 ]
Cho, Sungzoon [2 ,3 ]
机构
[1] Sungkyunkwan Univ, Dept Syst Management Engn, Suwon, South Korea
[2] Seoul Natl Univ, Dept Ind Engn, Seoul, South Korea
[3] Seoul Natl Univ, Inst Ind Syst Innovat, Seoul, South Korea
[4] Samsung Elect Co Ltd, Global Technol Ctr, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
data mining; predictive modelling; failure prediction; production data; missing value; NEURAL-NETWORKS; FAULT-DETECTION; DATA IMPUTATION; ROC CURVE; VALUES; QUALITY; AREA; MAP;
D O I
10.1080/00207543.2017.1407883
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In production data, missing values commonly appear for several reasons including changes in measurement and inspection items, sampling inspections, and unexpected process events. When applied to product failure prediction, the incompleteness of data should be properly addressed to avoid performance degradation in prediction models. Well-known approaches for missing data treatment, such as elimination and imputation, would not perform well under usual scenarios in production data, including high missing rate, systematic missing and class imbalance. To address these limitations, here we present a method for predictive modelling with missing data by considering the characteristics of production data. It builds multiple prediction models on different complete data subsets derived from the original data-set, each of which has different coverage of instances and input variables. These models are selectively used to make predictions for new instances with missing values. We demonstrate the effectiveness of the proposed method through a case study using actual data-sets from a home appliance manufacturer.
引用
收藏
页码:4849 / 4859
页数:11
相关论文
共 50 条
  • [41] Failure data analytics to build failure prediction mechanisms
    Dey, Sandipan
    Jacob, Kandathil Koshy
    Lopez, Javier Alonso
    Trivedi, Kishor
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2013, : 102 - +
  • [42] Workload Failure Prediction for Data Centers
    Li, Jie
    Wang, Rui
    Ali, Ghazanfar
    Dang, Tommy
    Sill, Alan
    Chen, Yong
    [J]. 2023 IEEE 16TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD, 2023, : 479 - 485
  • [43] An alternative data filling approach for prediction of missing data in soft sets (ADFIS)
    Khan, Muhammad Sadiq
    Al-Garadi, Mohammed Ali
    Wahab, Ainuddin Wahid Abdul
    Herawan, Tutut
    [J]. SPRINGERPLUS, 2016, 5
  • [44] A product-multinomial framework for categorical data analysis with missing responses
    Poleto, Frederico Z.
    Singer, Julio M.
    Paulino, Carlos Daniel
    [J]. BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2014, 28 (01) : 109 - 139
  • [45] A Proportional Intensity Model with Frailty for Missing Recurrent Failure Data
    Bae, Suk Joo
    Mun, Byeong Min
    Zhu, Xiaoyan
    [J]. TECHNOMETRICS, 2024, 66 (02) : 253 - 266
  • [46] Toward the Imputation and Prediction of Condition Monitoring Data with Missing Values
    Zhang, Di
    Li, Canbing
    Zhu, Jizhong
    [J]. 2023 IEEE/IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA, I&CPS ASIA, 2023, : 996 - 1002
  • [47] Finite sample prediction and interpolation for ARIMA models with missing data
    Penzer, J
    Shea, B
    [J]. JOURNAL OF FORECASTING, 1999, 18 (06) : 411 - 419
  • [48] Model Selection to Enhance Prediction Performance in the Presence of Missing Data
    Bashir, Faraj
    Wei, Hua-Liang
    Benomair, Abdollha
    [J]. 2015 20TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2015, : 846 - 850
  • [49] Hybrid prediction model with missing value imputation for medical data
    Purwar, Archana
    Singh, Sandeep Kumar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5621 - 5631
  • [50] Optimization of sand casting performance parameters and missing data prediction
    Xu, Qingwei
    Xu, Kaili
    Li, Li
    Yao, Xiwen
    [J]. ROYAL SOCIETY OPEN SCIENCE, 2019, 6 (08):