Practical approaches in evaluating validation and biases of machine learning applied to mobile health studies

被引:2
|
作者
Allgaier, Johannes [1 ]
Pryss, Ruediger [1 ]
机构
[1] Julius Maximilians Univ Wurzburg, Inst Clin Epidemiol & Biometry, Josef Schneider Str 2, Wurzburg, Germany
来源
COMMUNICATIONS MEDICINE | 2024年 / 4卷 / 01期
关键词
MODEL SELECTION;
D O I
10.1038/s43856-024-00468-0
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background Machine learning (ML) models are evaluated in a test set to estimate model performance after deployment. The design of the test set is therefore of importance because if the data distribution after deployment differs too much, the model performance decreases. At the same time, the data often contains undetected groups. For example, multiple assessments from one user may constitute a group, which is usually the case in mHealth scenarios.Methods In this work, we evaluate a model's performance using several cross-validation train-test-split approaches, in some cases deliberately ignoring the groups. By sorting the groups (in our case: Users) by time, we additionally simulate a concept drift scenario for better external validity. For this evaluation, we use 7 longitudinal mHealth datasets, all containing Ecological Momentary Assessments (EMA). Further, we compared the model performance with baseline heuristics, questioning the essential utility of a complex ML model.Results Hidden groups in the dataset leads to overestimation of ML performance after deployment. For prediction, a user's last completed questionnaire is a reasonable heuristic for the next response, and potentially outperforms a complex ML model. Because we included 7 studies, low variance appears to be a more fundamental phenomenon of mHealth datasets.Conclusions The way mHealth-based data are generated by EMA leads to questions of user and assessment level and appropriate validation of ML models. Our analysis shows that further research needs to follow to obtain robust ML models. In addition, simple heuristics can be considered as an alternative for ML. Domain experts should be consulted to find potentially hidden groups in the data. Computational approaches can be used to analyse health-related data collected using mobile applications from thousands of participants. We tested the impact of some participants being represented multiple times or some not being counted properly within the analysis. In this context, we label a multi-represented participant a group. We find that ignoring such groups can lead to false estimation of health-related predictions. In some cases, simpler quantitative methods can outperform complex computational models. This highlights the importance of monitoring and validating results conducted by complex computational models and confers the use of simpler analytical methods in its place. Allgaier et al. scrutinize the performance of machine learning (ML) models applied to 7 longitudinal mHealth datasets. Their study advocates for considering simple heuristics over complex ML models, underscoring the need for robust validation and expert consultation to address hidden groups in data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Evaluating Privacy Policy for Mobile Health APPs with Machine Learning
    Yang Z.
    Zhouzhou Y.
    Qiqi S.
    Zhonghang L.
    Data Analysis and Knowledge Discovery, 2022, 6 (05): : 112 - 126
  • [2] Evaluating and implementing machine learning models for personalised mobile health app recommendations
    Morenigbade, Hafsat
    Al Jaber, Tareq
    Gordon, Neil
    Eke, Gregory
    PLOS ONE, 2025, 20 (03):
  • [3] Error assessment and optimal cross-validation approaches in machine learning applied to impurity diffusion
    Lu, Hai-Jin
    Zou, Nan
    Jacobs, Ryan
    Afflerbach, Ben
    Lu, Xiao-Gang
    Morgans, Dane
    COMPUTATIONAL MATERIALS SCIENCE, 2019, 169
  • [4] Machine Learning Approaches in Smart Health
    Rayan, Zeina
    Alfonse, Marco
    Salem, Abdel-Badeeh M.
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019], 2019, 154 : 361 - 368
  • [5] Machine learning approaches applied in spinal pain research
    Falla, Deborah
    Devecchi, Valter
    Jimenez-Grande, David
    Rugamer, David
    Liew, Bernard X. W.
    JOURNAL OF ELECTROMYOGRAPHY AND KINESIOLOGY, 2021, 61
  • [6] A Survey of Machine Learning Approaches for Mobile Robot Control
    Rybczak, Monika
    Popowniak, Natalia
    Lazarowska, Agnieszka
    ROBOTICS, 2024, 13 (01)
  • [7] A Clinical Framework for Evaluating Machine Learning Studies
    Ghazi, Lama
    Ahmad, Tariq
    Wilson, Francis Perry
    JACC-HEART FAILURE, 2022, 10 (09) : 648 - 650
  • [8] Explainable Online Validation of Machine Learning Models for Practical Applications
    Fuhl, Wolfgang
    Rong, Yao
    Motz, Thomas
    Scheidt, Michael
    Hartel, Andreas
    Koch, Andreas
    Kasneci, Enkelejda
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3304 - 3311
  • [9] Machine Learning Approaches applied to Brazilian Portuguese Word Prediction
    Cavalieri, Daniel Cruz
    Bastos Filho, Teodiano Freire
    Palazuelos Cagigas, Sira Elena
    Guarasa, Javier Macias
    Martin Sanchez, Jose L.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 87 - 94
  • [10] Evaluating hierarchical machine learning approaches to classify biological databases
    Rezende, Pamela M.
    Xavier, Joicymara S.
    Ascher, David B.
    Fernandes, Gabriel R.
    Pires, Douglas E., V
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)