A forest-based algorithm for selecting informative variables using Variable Depth Distribution

被引:4
|
作者
Voronov, Sergii [1 ]
Jung, Voronov Daniel [1 ]
Frisk, Erik [1 ]
机构
[1] Linkoping Univ, Dept Elect Engn, S-58183 Linkoping, Sweden
关键词
Variable selection; Random Survival Forest; Random Forest; Automotive; MISFIRE DETECTION; SURVIVAL;
D O I
10.1016/j.engappai.2020.104073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predictive maintenance of systems and their components in technical systems is a promising approach to optimize system usage and reduce system downtime. Various sensor data are logged during system operation for different purposes, but sometimes not directly related to the degradation of a specific component. Variable selection algorithms are necessary to reduce model complexity and improve interpretability of diagnostic and prognostic algorithms. This paper presents a forest-based variable selection algorithm that analyzes the distribution of a variable in the decision tree structure, called Variable Depth Distribution, to measure its importance. The proposed variable selection algorithm is developed for datasets with correlated variables that pose problems for existing forest-based variable selection methods. The proposed variable selection method is evaluated and analyzed using three case studies: survival analysis of lead-acid batteries in heavy-duty vehicles, engine misfire detection, and a simulated prognostics dataset. The results show the usefulness of the proposed algorithm, with respect to existing forest-based methods, and its ability to identify important variables in different applications. As an example, the battery prognostics case study shows that similar predictive performance is achieved when only 17% percent of the variables are used compared to all measured signals.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Novel Framework for Selecting Informative Meteorological Stations Using Monte Carlo Feature Selection (MCFS) Algorithm
    Niaz, Rizwan
    Almanjahie, Ibrahim M.
    Ali, Zulfiqar
    Faisal, Muhammad
    Hussain, Ijaz
    ADVANCES IN METEOROLOGY, 2020, 2020
  • [32] itsdm: Isolation forest-based presence-only species distribution modelling and explanation in r
    Song, Lei
    Estes, Lyndon
    METHODS IN ECOLOGY AND EVOLUTION, 2023, 14 (03): : 831 - 840
  • [33] An Efficient Forest-Based Tabu Search Algorithm for the Split-Delivery Vehicle Routing Problem
    Zhang, Zizhen
    He, Huang
    Luo, Zhixing
    Qin, Hu
    Guo, Songshan
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3432 - 3438
  • [34] A random forest-based algorithm for data-intensive spatial interpolation in crop yield mapping
    Mariano, Cordoba
    Monica, Balzarini
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184
  • [35] Sequential analysis of latent variables using mixed-effect latent variable models:: Impact of non-informative and informative missing data
    Sebille, Veronique
    Hardouin, Jean-Benoit
    Mesbah, Mounir
    STATISTICS IN MEDICINE, 2007, 26 (27) : 4889 - 4904
  • [36] MicroHDF: predicting host phenotypes with metagenomic data using a deep forest-based framework
    Shi, Kai
    Liu, Qiaohui
    Ji, Qingrong
    He, Qisheng
    Zhao, Xing-Ming
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [37] Estimating Software Development Efforts Using a Random Forest-Based Stacked Ensemble Approach
    Varshini, Priya A. G.
    Kumari, Anitha K.
    Varadarajan, Vijayakumar
    ELECTRONICS, 2021, 10 (10)
  • [38] Selecting neighborhood points algorithm based on space distribution weight coefficient
    Sun, Lishuang
    Wang, Ende
    Wang, Jingli
    Ma, Yuntao
    Shenyang Jianzhu Daxue Xuebao (Ziran Kexue Ban)/Journal of Shenyang Jianzhu University (Natural Science), 2007, 23 (03): : 423 - 426
  • [39] Cost-sensitive feature selection using random forest: Selecting low-cost subsets of informative features
    Zhou, Qifeng
    Zhou, Hao
    Li, Tao
    KNOWLEDGE-BASED SYSTEMS, 2016, 95 : 1 - 11
  • [40] Learning in the Field: Using Community Self Studies to Strengthen Forest-Based Social Movements
    Taylor, Peter Leigh
    Cronkleton, Peter
    Barry, Deborah
    SUSTAINABLE DEVELOPMENT, 2013, 21 (04) : 209 - 223