Feature selection for quality prediction under distribution shift

被引:0
|
作者
Liu, Wenyi [1 ]
Yairi, Takehisa [1 ]
Tamai, Nana [2 ]
机构
[1] Univ Tokyo, Dept Adv Interdisciplinary Studies, Tokyo, Japan
[2] ENEOS, Data Sci Grp, Cent Tech Res Lab, Innovat Technol Ctr, Yokohama, Kanagawa, Japan
关键词
Quality prediction; Feature selection; Relief; Distribution shift;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distribution shifts due to the system and the external reasons in industrial processes are very common, which devastates the predictions of the linear model easily. This paper provides a practical solution for quality prediction under the circumstance of constant data shifts. Through analyzing a real-world petroleum plant data set, we show that via keeping the linear model as simple and relevant to the task as possible, the impacts of the data shift on the linear model can be diminished, under the assumption that the relationships between the target variable and the explanatory variables maintain in a similar state. In particular, we propose a pragmatic procedure for feature selection, including dealing with redundant features, considering interactions and enlarging the feature space. Extensive experiments demonstrate the effectiveness of this method, and comprehensive analysis of the results confirms and supports this finding.
引用
下载
收藏
页码:548 / 552
页数:5
相关论文
共 50 条
  • [41] Feature selection based on quality of information
    Liu, Jinghua
    Lin, Yaojin
    Lin, Menglei
    Wu, Shunxiang
    Zhang, Jia
    NEUROCOMPUTING, 2017, 225 : 11 - 22
  • [42] Distribution-Free Prediction Intervals Under Covariate Shift, With an Application to Causal Inference
    Qin, Jing
    Liu, Yukun
    Li, Moming
    Huang, Chiung-Yu
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [43] Prediction of the shift of the distribution of Pinus brutia Ten. Under future climate model
    E. Seda Arslan
    Ömer K. Örücü
    Süleyman Gülcü
    Samet Dirlik
    Ecem Hoşgör
    New Forests, 2025, 56 (2)
  • [44] Feature Selection Under Fairness Constraints
    Dorleon, Ginel
    Megdiche, Imen
    Bricon-Souf, Nathalie
    Teste, Olivier
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1125 - 1127
  • [45] Feature Selection Under a Complexity Constraint
    Plasberg, Jan H.
    Kleijn, W. Bastiaan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (03) : 565 - 571
  • [46] Conformal Prediction Under Covariate Shift
    Tibshirani, Ryan J.
    Barber, Rina Foygel
    Candes, Emmanuel J.
    Ramdas, Aaditya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Performance Prediction Under Dataset Shift
    Maggio, Simona
    Bouvier, Victor
    Dreyfus-Schmidt, Leo
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2466 - 2474
  • [48] Metaheuristic feature selection for software fault prediction
    Kumar, Kulamala Vinod
    Kumari, Priyanka
    Rao, Madhuri
    Mohapatra, Durga Prasad
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1013 - 1020
  • [49] A Feature Selection Method for Prediction Essential Protein
    Zhong, Jiancheng
    Wang, Jianxin
    Peng, Wei
    Zhang, Zhen
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (05) : 491 - 499
  • [50] Feature subset selection for splice site prediction
    Degroeve, S
    De Baets, B
    Van de Peer, Y
    Rouzé, P
    BIOINFORMATICS, 2002, 18 : S75 - S83