Feature selection for quality prediction under distribution shift

被引:0
|
作者
Liu, Wenyi [1 ]
Yairi, Takehisa [1 ]
Tamai, Nana [2 ]
机构
[1] Univ Tokyo, Dept Adv Interdisciplinary Studies, Tokyo, Japan
[2] ENEOS, Data Sci Grp, Cent Tech Res Lab, Innovat Technol Ctr, Yokohama, Kanagawa, Japan
关键词
Quality prediction; Feature selection; Relief; Distribution shift;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distribution shifts due to the system and the external reasons in industrial processes are very common, which devastates the predictions of the linear model easily. This paper provides a practical solution for quality prediction under the circumstance of constant data shifts. Through analyzing a real-world petroleum plant data set, we show that via keeping the linear model as simple and relevant to the task as possible, the impacts of the data shift on the linear model can be diminished, under the assumption that the relationships between the target variable and the explanatory variables maintain in a similar state. In particular, we propose a pragmatic procedure for feature selection, including dealing with redundant features, considering interactions and enlarging the feature space. Extensive experiments demonstrate the effectiveness of this method, and comprehensive analysis of the results confirms and supports this finding.
引用
收藏
页码:548 / 552
页数:5
相关论文
共 50 条
  • [1] THE USE OF UNDER- AND OVERSAMPLING WITHIN ENSEMBLE FEATURE SELECTION AND CLASSIFICATION FOR SOFTWARE QUALITY PREDICTION
    Gao, Kehan
    Khoshgoftaar, Taghi M.
    Wald, Randall
    [J]. INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2014, 21 (01)
  • [2] Feature selection for label distribution learning under feature weight view
    Lin, Shidong
    Wang, Chenxi
    Mao, Yu
    Lin, Yaojin
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1827 - 1840
  • [3] Feature selection for label distribution learning under feature weight view
    Shidong Lin
    Chenxi Wang
    Yu Mao
    Yaojin Lin
    [J]. International Journal of Machine Learning and Cybernetics, 2024, 15 : 1827 - 1840
  • [4] A Systematic Review of Feature Selection Techniques in Software Quality Prediction
    Alsolai, Hadeel
    Roper, Marc
    [J]. 2019 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2019,
  • [5] Feature selection for data driven prediction of protein model quality
    Montuori, Alfonso
    Pugliese, Luisa
    Raimondo, Giovanni
    Pasero, Eros
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3561 - +
  • [6] Prediction with Incomplete Data under Agnostic Mask Distribution Shift
    Zhu, Yichen
    Yuan, Jian
    Jiang, Bo
    Lin, Tao
    Jin, Haiming
    Wang, Xinbing
    Zhou, Chenghu
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4720 - 4728
  • [7] On Prior Selection and Covariate Shift of -Bayesian Prediction Under -Divergence Risk
    Suzuki, Taiji
    Komaki, Fumiyasu
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (8-9) : 1655 - 1673
  • [8] Improving the quality of enzyme prediction by using feature selection and dimensionality reduction
    Brito, Luis H.
    Lara, Ana L. C. V.
    Zarate, Luis E.
    Nobre, Cristiane N.
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Air Quality Prediction Based on Kohonen Clustering and ReliefF Feature Selection
    Chen, Bolun
    Zhu, Guochang
    Ji, Min
    Yu, Yongtao
    Zhao, Jianyang
    Liu, Wei
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (02): : 1039 - 1049
  • [10] Deep Feature Selection Framework for Quality Prediction in Injection Molding Process
    Kim, Iljeok
    Na, Juwon
    Yun, Jong Pil
    Lee, Seungchul
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (01) : 503 - 512