A forest-based algorithm for selecting informative variables using Variable Depth Distribution

被引:4
|
作者
Voronov, Sergii [1 ]
Jung, Voronov Daniel [1 ]
Frisk, Erik [1 ]
机构
[1] Linkoping Univ, Dept Elect Engn, S-58183 Linkoping, Sweden
关键词
Variable selection; Random Survival Forest; Random Forest; Automotive; MISFIRE DETECTION; SURVIVAL;
D O I
10.1016/j.engappai.2020.104073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predictive maintenance of systems and their components in technical systems is a promising approach to optimize system usage and reduce system downtime. Various sensor data are logged during system operation for different purposes, but sometimes not directly related to the degradation of a specific component. Variable selection algorithms are necessary to reduce model complexity and improve interpretability of diagnostic and prognostic algorithms. This paper presents a forest-based variable selection algorithm that analyzes the distribution of a variable in the decision tree structure, called Variable Depth Distribution, to measure its importance. The proposed variable selection algorithm is developed for datasets with correlated variables that pose problems for existing forest-based variable selection methods. The proposed variable selection method is evaluated and analyzed using three case studies: survival analysis of lead-acid batteries in heavy-duty vehicles, engine misfire detection, and a simulated prognostics dataset. The results show the usefulness of the proposed algorithm, with respect to existing forest-based methods, and its ability to identify important variables in different applications. As an example, the battery prognostics case study shows that similar predictive performance is achieved when only 17% percent of the variables are used compared to all measured signals.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Random forest-based physical activities recognition by using wearable sensors
    JUNJIE, Z. H. A. N. G.
    SHENGHAO, C. A., I
    JIE, X. U.
    HUA, Y. U. A. N.
    INDUSTRIA TEXTILA, 2022, 73 (01): : 27 - 33
  • [22] Research on the Application of Random Forest-based Feature Selection Algorithm in Data Mining Experiments
    Wang, Huan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 505 - 518
  • [23] ISOLATION FOREST-BASED LEAST SQUARES TWIN MARGIN DISTRIBUTION SUPPORT VECTOR REGRESSION
    Feng, Wei
    Shen, Geliang
    Xu, Benye
    Gu, Binjie
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2021, 17 (02): : 565 - 579
  • [24] dipm: an R package implementing the Depth Importance in Precision Medicine (DIPM) tree and Forest-based method
    Chen, Victoria
    Li, Cai
    Zhang, Heping
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [25] Random forest-based approach for physiological functional variable selection for driver's stress level classification
    El Haouij, Neska
    Poggi, Jean-Michel
    Ghozi, Raja
    Sevestre-Ghalila, Sylvie
    Jaidane, Meriem
    STATISTICAL METHODS AND APPLICATIONS, 2019, 28 (01): : 157 - 185
  • [26] Selecting Some Variables to Update-Based Algorithm for Solving Optimization Problems
    Dehghani, Mohammad
    Trojovsky, Pavel
    SENSORS, 2022, 22 (05)
  • [27] Random forest-based approach for physiological functional variable selection for driver’s stress level classification
    Neska El Haouij
    Jean-Michel Poggi
    Raja Ghozi
    Sylvie Sevestre-Ghalila
    Mériem Jaïdane
    Statistical Methods & Applications, 2019, 28 : 157 - 185
  • [28] tRForest: a novel random forest-based algorithm for tRNA-derived fragment target prediction
    Parikh, Rohan
    Wilson, Briana
    Marrah, Laine
    Su, Zhangli
    Saha, Shekhar
    Kumar, Pankaj
    Huang, Fenix
    Dutta, Anindya
    NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (02)
  • [29] A Random Forest-Based Algorithm to Distinguish Ulva prolifera and Sargassum From Multispectral Satellite Images
    Xiao, Yanfang
    Liu, Rongjie
    Kim, Keunyong
    Zhang, Jie
    Cui, Tingwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] Research and performance analysis of random forest-based feature selection algorithm in sports effectiveness evaluation
    Li, Yujiao
    Mu, Yingjie
    SCIENTIFIC REPORTS, 2024, 14 (01):