Multi-label classification;
Multi-output modeling;
Missing value imputation;
Probabilistic inference;
D O I:
10.1007/s10994-024-06584-1
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Missing values are a common problem in data science and machine learning. Removing instances with missing values is a straightforward workaround, but this can significantly hinder subsequent data analysis, particularly when features outnumber instances. There are a variety of methodologies proposed in the literature for imputing missing values. Denoising Autoencoders, for example, have been leveraged efficiently for imputation. However, neural network approaches have been relatively less effective on smaller datasets. In this work, we propose Autoreplicative Random Forests (ARF) as a multi-output learning approach, which we introduce in the context of a framework that may impute via either an iterative or procedural process. Experiments on several low- and high-dimensional datasets show that ARF is computationally efficient and exhibits better imputation performance than its competitors, including neural network approaches. In order to provide statistical analysis and mathematical background to the proposed missing value imputation framework, we also propose probabilistic ARFs, where the confidence values are provided over different imputation hypotheses, therefore maximizing the utility of such a framework in a machine-learning pipeline targeting predictive performance.
机构:
Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R ChinaWuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R China
Ou, Hongsen
Yao, Yunan
论文数: 0引用数: 0
h-index: 0
机构:
Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R ChinaWuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R China
Yao, Yunan
He, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Wuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R ChinaWuhan Univ Technol, Sch Naval Architecture Ocean & Energy Power Engn, Wuhan 430063, Peoples R China