Method for filling missing data of mine ventilation parameters

被引:0
|
作者
Ni J. [1 ,2 ]
Liu X. [1 ,2 ]
Deng L. [1 ,2 ]
机构
[1] College of Safety Science & Engineering, Liaoning Technical University, Huludao
[2] Key Laboratory of Mine Thermo-Motive Disaster and Prevention, Ministry of Education, Liaoning Technical University, Huludao
来源
关键词
data completion; mine ventilation; missing data; multiple interpolation of chain equations; random forest;
D O I
10.13225/j.cnki.jccs.2023.0481
中图分类号
学科分类号
摘要
The intelligent mine ventilation system is very important for the intelligent construction of coal mines. In order to solve the problem of missing mine ventilation parameter data caused by the lack of measurement conditions, instrument signal interference, uneven wind speed of roadway section, improper manual operation and other restrictive factors during actual measurement of mine ventilation parameters, a method for filling the missing data of mine ventilation parameters based on the multiple imputation method of random forest-chained equation was proposed. Multiple imputation with chained equations is used to generate n filled values for each missing attribute value by iterations, resulting in n complete datasets, and a final complete dataset is obtained by analyzing and optimizing the n complete datasets. In order to improve the filling accuracy of missing values, the influence of the uncertainty of missing data of mine ventilation parameters on the analysis process is reasonably considered, and the missing data is filled in the prediction task of random forest in combination with the prediction mean matching model. Taking the Luxin No.2 Mine as an experimental example, the intelligent mine ventilation simulation system IMVS was used to preprocess the original data set of ventilation parameters of the Luxin No.2 Mine to obtain a complete and accurate complete dataset of mine ventilation parameters. Comparative experiments with different missing attributes, different data missing rates, and different number of iterations were conducted separately for the complete data set. The effectiveness of the model was evaluated by a variety of model evaluation indicators. The results show that the complete data set formed by the multiple imputation method of random forest-chained equation has good similarity with the original data set. Results of filling experiments with different missing columns show that the filling model can easily handle mixed data types, autonomously learning the correlations between parameters and thus reducing filling complexity. The n datasets formed after iterations are combined into a final dataset by analysis, which improves the filling accuracy. Experiments with different iterations on the complete data set after initial filling show that the data correlation will converge after a certain number of iterations. © 2024 China Coal Society. All rights reserved.
引用
收藏
页码:2315 / 2323
页数:8
相关论文
共 22 条
  • [1] LU Xinming, YIN Hong, The intelligent theory and technology of mine ventilation, Journal of China Coal Society, 45, 6, pp. 2236-2247, (2020)
  • [2] ZHOU Fubao, WEI Lianjiang, XIA Tongqiang, Et al., Principle of intelligent ventilation in mine, key technology and its preliminary implementation, Journal of China Coal Society, 45, 6, pp. 2225-2235, (2020)
  • [3] ZHOU Fubao, XIN Haihui, WEI Lianjiang, Et al., Research progress of mine intelligent ventilation theory and technology[J], Coal Science and Technology, 51, 1, (2023)
  • [4] HUANG Haojian, An efficient method to classification with missing data[J], Academic Journal of Computing & Information Science, 4, 8, (2021)
  • [5] ZHAO Zhiwen, GAO Min, Parameter estimation of random coefficient autoregressive model with missing data[J], Statistics & Decision, 38, 1, (2022)
  • [6] YU Jiayin, HE Yulin, CUI Laizhong, Et al., A distributionally consistent missing value interpolation algorithm for large-scale data[J], Journal of Tsinghua University (Natural Science Edition), 63, 5, (2023)
  • [7] MA X, HAN Y, QIN H, Et al., KNN data filling algorithm for incomplete interval-valued fuzzy soft sets[J], International Journal of Computational Intelligence Systems, 16, 1, (2023)
  • [8] LU Y., Analysis of image restoration based on EM algorithm[J], Journal of Physics:Conference Series, 2242, 1, (2022)
  • [9] WANG Fengmei, HU Lixia, A missing data imputation method based on neighbor rules[J], Computer Engineering, 38, 21, (2012)
  • [10] LYDERSEN Stian, Multiple imputation of missing data, Tidsskrift for den Norske laegeforening: Tidsskrift for praktisk medicin, 142, 2, (2022)