Flexible Factor Model for Handling Missing Data in Supervised Learning

被引:0
|
作者
Bekker, Andriette [1 ]
Hashemi, Farzane [1 ,2 ]
Arashi, Mohammad [1 ,3 ]
机构
[1] Univ Pretoria, Dept Stat, Fac Nat & Agr Sci, Pretoria, South Africa
[2] Univ Kashan, Fac Math Sci, Dept Stat, Kashan, Iran
[3] Ferdowsi Univ Mashhad, Dept Stat, Fac Math Sci, Mashhad, Razavi Khorasan, Iran
基金
新加坡国家研究基金会;
关键词
Automobile dataset; Asymmetry; ECME algorithm; Factor analysis model; Heavy tails; Incomplete data; Liver disorders dataset; FACTOR ANALYZERS; MIXTURES; INFERENCE; EXTENSION; ALGORITHM; ECM; EM;
D O I
10.1007/s40304-021-00260-9
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper presents an extension of the factor analysis model based on the normal mean-variance mixture of the Birnbaum-Saunders in the presence of nonresponses and missing data. This model can be used as a powerful tool to model non-normal features observed from data such as strongly skewed and heavy-tailed noises. Missing data may occur due to operator error or incomplete data capturing therefore cannot be ignored in factor analysis modeling. We implement an EM-type algorithm for maximum likelihood estimation and propose single imputation of possible missing values under a missing at random mechanism. The potential and applicability of our proposed method are illustrated through analyzing both simulated and real datasets.
引用
收藏
页码:477 / 501
页数:25
相关论文
共 50 条
  • [1] Flexible Factor Model for Handling Missing Data in Supervised Learning
    Andriette Bekker
    Farzane Hashemi
    Mohammad Arashi
    [J]. Communications in Mathematics and Statistics, 2023, 11 : 477 - 501
  • [2] Missing Data Imputation for Supervised Learning
    Poulos, Jason
    Valle, Rafael
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (02) : 186 - 196
  • [3] Active Learning for Handling Missing Data
    Tharwat, Alaa
    Schenck, Wolfram
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [4] Multiple Imputation A Flexible Tool for Handling Missing Data
    Li, Peng
    Stuart, Elizabeth A.
    Allison, David B.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 314 (18): : 1966 - 1967
  • [5] Handling incomplete categorical data for supervised learning
    Chien, Been-Chian
    Lu, Cheng-Feng
    Hsu, Steen J.
    [J]. ADVANCES IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4031 : 1318 - 1328
  • [6] Handling Missing Data with Graph Representation Learning
    You, Jiaxuan
    Ma, Xiaobai
    Ding, Daisy Yi
    Kochenderfer, Mykel
    Leskovec, Jure
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [7] Adaptive Data Augmentation for Supervised Learning over Missing Data
    Liu, Tongyu
    Fan, Ju
    Luo, Yinqing
    Tang, Nan
    Li, Guoliang
    Du, Xiaoyong
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (07): : 1202 - 1214
  • [8] Distributed Semi-Supervised Learning With Missing Data
    Xu, Zhen
    Liu, Ying
    Li, Chunguang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (12) : 6165 - 6178
  • [9] A supervised machine learning model for imputing missing boarding stops in smart card data
    Shalit, Nadav
    Fire, Michael
    Ben-Elia, Eran
    [J]. PUBLIC TRANSPORT, 2023, 15 (02) : 287 - 319
  • [10] A supervised machine learning model for imputing missing boarding stops in smart card data
    Nadav Shalit
    Michael Fire
    Eran Ben-Elia
    [J]. Public Transport, 2023, 15 : 287 - 319