Product failure prediction with missing data

被引:19
|
作者
Kang, Seokho [1 ]
Kim, Eunji [2 ,3 ]
Shim, Jaewoong [2 ,3 ]
Chang, Wonsang [4 ]
Cho, Sungzoon [2 ,3 ]
机构
[1] Sungkyunkwan Univ, Dept Syst Management Engn, Suwon, South Korea
[2] Seoul Natl Univ, Dept Ind Engn, Seoul, South Korea
[3] Seoul Natl Univ, Inst Ind Syst Innovat, Seoul, South Korea
[4] Samsung Elect Co Ltd, Global Technol Ctr, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
data mining; predictive modelling; failure prediction; production data; missing value; NEURAL-NETWORKS; FAULT-DETECTION; DATA IMPUTATION; ROC CURVE; VALUES; QUALITY; AREA; MAP;
D O I
10.1080/00207543.2017.1407883
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In production data, missing values commonly appear for several reasons including changes in measurement and inspection items, sampling inspections, and unexpected process events. When applied to product failure prediction, the incompleteness of data should be properly addressed to avoid performance degradation in prediction models. Well-known approaches for missing data treatment, such as elimination and imputation, would not perform well under usual scenarios in production data, including high missing rate, systematic missing and class imbalance. To address these limitations, here we present a method for predictive modelling with missing data by considering the characteristics of production data. It builds multiple prediction models on different complete data subsets derived from the original data-set, each of which has different coverage of instances and input variables. These models are selectively used to make predictions for new instances with missing values. We demonstrate the effectiveness of the proposed method through a case study using actual data-sets from a home appliance manufacturer.
引用
收藏
页码:4849 / 4859
页数:11
相关论文
共 50 条
  • [1] Product failure prediction with missing data using graph neural networks
    Kang, Seokho
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12): : 7225 - 7234
  • [2] Product failure prediction with missing data using graph neural networks
    Seokho Kang
    [J]. Neural Computing and Applications, 2021, 33 : 7225 - 7234
  • [3] Data quality assessment in product failure prediction models
    Ferencek, Aljaz
    Kljajic Borstnar, Mirjana
    [J]. JOURNAL OF DECISION SYSTEMS, 2020, 29 (sup1) : 79 - 86
  • [4] An Empirical Investigation of Missing Data Handling in Cloud Node Failure Prediction
    Ma, Minghua
    Liu, Yudong
    Tong, Yuang
    Li, Haozhe
    Zhao, Pu
    Xu, Yong
    Zhang, Hongyu
    He, Shilin
    Wang, Lu
    Dang, Yingnong
    Rajmohan, Saravanakumar
    Lin, Qingwei
    [J]. PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1453 - 1464
  • [5] A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction
    Hu, Zhiyong
    Du, Dongping
    [J]. PLOS ONE, 2020, 15 (09):
  • [6] Challenges of Machine Failure Prediction with Product Data - A Case Study
    Buhl, Dominik
    Lanquillon, Carsten
    [J]. ARTIFICIAL INTELLIGENCE IN HCI, PT II, AI-HCI 2024, 2024, 14735 : 308 - 322
  • [7] Semiparametric regression and risk prediction with competing risks data under missing cause of failure
    Bakoyannis, Giorgos
    Zhang, Ying
    Yiannoutsos, Constantin T.
    [J]. LIFETIME DATA ANALYSIS, 2020, 26 (04) : 659 - 684
  • [8] Semiparametric regression and risk prediction with competing risks data under missing cause of failure
    Giorgos Bakoyannis
    Ying Zhang
    Constantin T. Yiannoutsos
    [J]. Lifetime Data Analysis, 2020, 26 : 659 - 684
  • [9] Analysis of Failure Data with Missing Labels
    Cai, Jiaxiang
    Ye, Xin
    Tang, Loon Ching
    [J]. 2022 4TH INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY ENGINEERING, SRSE, 2022, : 500 - 504
  • [10] IOHMM for Location Prediction with Missing Data
    Hu, Jiawei
    Wang, Yanfeng
    Zhang, Ya
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 651 - 660