Deep Learning Approach for Imputation of Missing Values in Actigraphy Data: Algorithm Development Study

被引:16
|
作者
Jang, Jong-Hwan [1 ]
Choi, Junggu [1 ]
Roh, Hyun Woong [2 ]
Son, Sang Joon [3 ]
Hong, Chang Hyung [3 ]
Kim, Eun Young [1 ,4 ]
Kim, Tae Young [1 ]
Yoon, Dukyong [1 ,4 ]
机构
[1] Ajou Univ, Sch Med, Dept Biomed Informat, World Cup Ro 206, Suwon 16499, Gyeonggi Do, South Korea
[2] Ajou Univ, Sch Med, Dept Brain Sci, Suwon, Gyeonggi Do, South Korea
[3] Ajou Univ, Sch Med, Dept Psychiat, Suwon, Gyeonggi Do, South Korea
[4] Ajou Univ, Grad Sch Med, Dept Biomed Sci, Suwon, Gyeonggi Do, South Korea
来源
JMIR MHEALTH AND UHEALTH | 2020年 / 8卷 / 07期
关键词
accelerometer; actigraphy; imputation; autoencoder; deep learning; MEASURED PHYSICAL-ACTIVITY;
D O I
10.2196/16113
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Data collected by an actigraphy device worn on the wrist or waist can provide objective measurements for studies related to physical activity; however, some data may contain intervals where values are missing. In previous studies, statistical methods have been applied to impute missing values on the basis of statistical assumptions. Deep learning algorithms, however, can learn features from the data without any such assumptions and may outperform previous approaches in imputation tasks. Objective: The aim of this study was to impute missing values in data using a deep learning approach. Methods: To develop an imputation model for missing values in accelerometer-based actigraphy data, a denoising convolutional autoencoder was adopted. We trained and tested our deep learning-based imputation model with the National Health and Nutrition Examination Survey data set and validated it with the external Korea National Health and Nutrition Examination Survey and the Korean Chronic Cerebrovascular Disease Oriented Biobank data sets which consist of daily records measuring activity counts. The partial root mean square error and partial mean absolute error of the imputed intervals (partial RMSE and partial MAE, respectively) were calculated using our deep learning-based imputation model (zero-inflated denoising convolutional autoencoder) as well as using other approaches (mean imputation, zero-inflated Poisson regression, and Bayesian regression). Results: The zero-inflated denoising convolutional autoencoder exhibited a partial RMSE of 839.3 counts and partial MAE of 431.1 counts, whereas mean imputation achieved a partial RMSE of 1053.2 counts and partial MAE of 545.4 counts, the zero-inflated Poisson regression model achieved a partial RMSE of 1255.6 counts and partial MAE of 508.6 counts, and Bayesian regression achieved a partial RMSE of 924.5 counts and partial MAE of 605.8 counts. Conclusions: Our deep learning-based imputation model performed better than the other methods when imputing missing values in actigraphy data.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach
    Madhu, G.
    Rajinikanth, T. V.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2012, : 81 - 87
  • [2] A First Approach on Big Data Missing Values Imputation
    Montesdeoca, Besay
    Luengo, Julian
    Maillo, Jesus
    Garcia-Gil, Diego
    Garcia, Salvador
    Herrera, Francisco
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 315 - 323
  • [3] DL-GSA: A Deep Learning Metaheuristic Approach to Missing Data Imputation
    Garg, Ayush
    Naryani, Deepika
    Aggarwal, Garvit
    Aggarwal, Swati
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2018, PT II, 2018, 10942 : 513 - 521
  • [4] Water-Quality Data Imputation with a High Percentage of Missing Values: A Machine Learning Approach
    Rodriguez, Rafael
    Pastorini, Marcos
    Etcheverry, Lorena
    Chreties, Christian
    Fossati, Monica
    Castro, Alberto
    Gorgoglione, Angela
    [J]. SUSTAINABILITY, 2021, 13 (11)
  • [5] Missing Values Imputation Using Genetic Algorithm for the Analysis of Traffic Data
    Midde, Ranjit Reddy
    Srinivasa, K. G.
    Reddy, Eswara B.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2017, 2018, 668 : 251 - 261
  • [6] Missing Data Imputation using Machine Learning Algorithm for Supervised Learning
    Cenitta, D.
    Arjunan, R. Vijaya
    Prema, K., V
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [7] Comparative analysis of traditional methods and a deep learning approach for multivariate imputation of missing values in the meteorological field
    Arias-Munoz, Ana Cristina
    Cob-Garcia, Susana
    Calvo-Valverde, Luis-Alexander
    [J]. TECNOLOGIA EN MARCHA, 2024, 37 (03): : 33 - 47
  • [8] Handling missing values in healthcare data: A systematic review of deep learning-based imputation techniques
    Liu, Mingxuan
    Li, Siqi
    Yuan, Han
    Ong, Marcus Eng Hock
    Ning, Yilin
    Xie, Feng
    Saffari, Seyed Ehsan
    Shang, Yuqing
    Volovici, Victor
    Chakraborty, Bibhas
    Liu, Nan
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 142
  • [9] Data Imputation for Symbolic Regression with Missing Values: A Comparative Study
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2093 - 2100
  • [10] Missing values imputation for a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 245 - 254