Method for filling missing data of mine ventilation parameters

被引:0
|
作者
Ni J. [1 ,2 ]
Liu X. [1 ,2 ]
Deng L. [1 ,2 ]
机构
[1] College of Safety Science & Engineering, Liaoning Technical University, Huludao
[2] Key Laboratory of Mine Thermo-Motive Disaster and Prevention, Ministry of Education, Liaoning Technical University, Huludao
来源
Meitan Xuebao/Journal of the China Coal Society | 2024年 / 49卷 / 05期
关键词
data completion; mine ventilation; missing data; multiple interpolation of chain equations; random forest;
D O I
10.13225/j.cnki.jccs.2023.0481
中图分类号
学科分类号
摘要
The intelligent mine ventilation system is very important for the intelligent construction of coal mines. In order to solve the problem of missing mine ventilation parameter data caused by the lack of measurement conditions, instrument signal interference, uneven wind speed of roadway section, improper manual operation and other restrictive factors during actual measurement of mine ventilation parameters, a method for filling the missing data of mine ventilation parameters based on the multiple imputation method of random forest-chained equation was proposed. Multiple imputation with chained equations is used to generate n filled values for each missing attribute value by iterations, resulting in n complete datasets, and a final complete dataset is obtained by analyzing and optimizing the n complete datasets. In order to improve the filling accuracy of missing values, the influence of the uncertainty of missing data of mine ventilation parameters on the analysis process is reasonably considered, and the missing data is filled in the prediction task of random forest in combination with the prediction mean matching model. Taking the Luxin No.2 Mine as an experimental example, the intelligent mine ventilation simulation system IMVS was used to preprocess the original data set of ventilation parameters of the Luxin No.2 Mine to obtain a complete and accurate complete dataset of mine ventilation parameters. Comparative experiments with different missing attributes, different data missing rates, and different number of iterations were conducted separately for the complete data set. The effectiveness of the model was evaluated by a variety of model evaluation indicators. The results show that the complete data set formed by the multiple imputation method of random forest-chained equation has good similarity with the original data set. Results of filling experiments with different missing columns show that the filling model can easily handle mixed data types, autonomously learning the correlations between parameters and thus reducing filling complexity. The n datasets formed after iterations are combined into a final dataset by analysis, which improves the filling accuracy. Experiments with different iterations on the complete data set after initial filling show that the data correlation will converge after a certain number of iterations. © 2024 China Coal Society. All rights reserved.
引用
收藏
页码:2315 / 2323
页数:8
相关论文
共 22 条
  • [11] CHEN Juan, WANG Xianyu, LUO Lingling, Et al., Comparison of machine learning and statistical learning in the imputation of missing values[J], Statistics & Decision, 36, 17, (2020)
  • [12] LIU Lushi, Quantitative prediction of seafloor sulfide mineralization based on machine learning and missing value interpolation technique, pp. 37-43, (2022)
  • [13] AWAN S E, BENNAMOUN M, SOHEL F, Et al., Imputation of missing data with class imbalance using conditional generative adversarial networks[J], Neurocomputing, 453, (2021)
  • [14] LIU Zegong, Calculation of branch wind resistance of complex ventilation network by air regulation and resistance measurement of ventilation system, Safety in Coal Mines, 22, 1, (1991)
  • [15] SI Junhong, CHEN Kaiyan, Measuring airflow & evaluating resistance model of the mine ventilation network based on Tikhonov regularization[J], Journal of China Coal Society, 37, 6, (2012)
  • [16] DENG Lijun, Research on inversion of mine ventilation resistance coefficient, Fuxin: Liaoning University of Engineering and Technology, pp. 41-55, (2014)
  • [17] LIU Jian, LI Xuebing, CHEN Tingkai, Et al., Theoretical analysis on influence of steady turbulence fluctuation on ventilation resistance measurement in mine[J], Journal of Safety Science and Technology, 12, 5, (2016)
  • [18] LI Yucheng, LI Junqiao, DENG Cunbao, Et al., Improved algorithm of air quantity calculating resistance based on diagonal subnetwork [J], Journal of China Coal Society, 44, 4, (2019)
  • [19] NI Jingfeng, Research on the visualization of mine ventilation simulation system, Fuxin: Liaoning University of Engineering and Technology, pp. 93-105, (2004)
  • [20] GRUND S, LUDTKE O, ROBITZSCH A., Multiple imputation of missing data in multilevel models with the R package mdmb: A flexible sequential modeling approach[J], Behavior Research Methods, 53, 6, (2021)