Product failure prediction with missing data

被引:19
|
作者
Kang, Seokho [1 ]
Kim, Eunji [2 ,3 ]
Shim, Jaewoong [2 ,3 ]
Chang, Wonsang [4 ]
Cho, Sungzoon [2 ,3 ]
机构
[1] Sungkyunkwan Univ, Dept Syst Management Engn, Suwon, South Korea
[2] Seoul Natl Univ, Dept Ind Engn, Seoul, South Korea
[3] Seoul Natl Univ, Inst Ind Syst Innovat, Seoul, South Korea
[4] Samsung Elect Co Ltd, Global Technol Ctr, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
data mining; predictive modelling; failure prediction; production data; missing value; NEURAL-NETWORKS; FAULT-DETECTION; DATA IMPUTATION; ROC CURVE; VALUES; QUALITY; AREA; MAP;
D O I
10.1080/00207543.2017.1407883
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In production data, missing values commonly appear for several reasons including changes in measurement and inspection items, sampling inspections, and unexpected process events. When applied to product failure prediction, the incompleteness of data should be properly addressed to avoid performance degradation in prediction models. Well-known approaches for missing data treatment, such as elimination and imputation, would not perform well under usual scenarios in production data, including high missing rate, systematic missing and class imbalance. To address these limitations, here we present a method for predictive modelling with missing data by considering the characteristics of production data. It builds multiple prediction models on different complete data subsets derived from the original data-set, each of which has different coverage of instances and input variables. These models are selectively used to make predictions for new instances with missing values. We demonstrate the effectiveness of the proposed method through a case study using actual data-sets from a home appliance manufacturer.
引用
收藏
页码:4849 / 4859
页数:11
相关论文
共 50 条
  • [31] A diagnostic for assessing the influence of cases on the prediction of missing data
    Cavanaugh, JE
    Oleson, JJ
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 2001, 50 : 427 - 440
  • [32] Prediction and Inference With Missing Data in Patient Alert Systems
    Storlie, Curtis B.
    Therneau, Terry M.
    Carter, Rickey E.
    Chia, Nicholas
    Bergquist, John R.
    Huddleston, Jeanne M.
    Romero-Brufau, Santiago
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 32 - 46
  • [33] Development of missing data prediction model for carbon monoxide
    Abd Rani, Nurul Latiffah
    Azid, Azman
    Sani, Muhamad Shirwan Abdullah
    Samsudin, Mohd Saiful
    Yusof, Ku Mohd Kalkausar Ku
    Amin, Siti Noor Syuhada Muhammad
    Khalit, Saiful Iskandar
    [J]. MALAYSIAN JOURNAL OF FUNDAMENTAL AND APPLIED SCIENCES, 2019, 15 (01): : 13 - 17
  • [34] Missing Data and ICU Mortality Prediction: Gone But Not to Be Forgotten*
    Nagrebetsky, Alexander
    Bittner, Edward A.
    [J]. CRITICAL CARE MEDICINE, 2017, 45 (12) : 2108 - 2109
  • [35] Prediction of Crowd Flow in City Complex with Missing Data
    Qiu, Shiyang
    Xu, Peng
    Zheng, Wei
    Wang, Junjie
    Yu, Guo
    Hou, Mingyao
    Liu, Hengchang
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 758 - 773
  • [36] Uncertainty -aware Traffic Prediction under Missing Data
    Mei, Hao
    Li, Junxian
    Liang, Zhiming
    Zheng, Guanjie
    Shi, Bin
    Wei, Hua
    [J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1223 - 1228
  • [37] Attempts Prediction by Missing Data Imputation in Engineering Degree
    Jove, Esteban
    Blanco-Rodriguez, Patricia
    Luis Casteleiro-Roca, Jose
    Moreno-Arboleda, Javier
    Antonio Lopez-Vazquez, Jose
    de Cos Juez, Francisco Javier
    Luis Calvo-Rolle, Jose
    [J]. INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 167 - 176
  • [38] Traffic Speed Prediction with Missing Data based on TGCN
    Ge, Liang
    Li, Hang
    Liu, Junling
    Zhou, Aoli
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 522 - 529
  • [39] PRODUCT DATA PREDICTION WITH UNCERTAINTY IN PRODUCT LIFE CYCLE DESIGN
    Yu Suiran Wang ChengtaoSchool of Mechanical Engineering
    [J]. Chinese Journal of Mechanical Engineering, 2003, (03) : 296 - 299
  • [40] A cautionary note on the use of the missing indicator method for handling missing data in prediction research
    van Smeden, Maarten
    Groenwold, Rolf H. H.
    Moons, Karel GM.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2020, 125 : 188 - 190