A novel machine learning-based prediction method for patients at risk of developing depressive symptoms using a small data

被引:0
|
作者
Yun, Minyoung [1 ,2 ]
Jeon, Minjeong [3 ]
Yang, Heyoung [4 ]
机构
[1] Korea Inst Sci & Technol Informat, Ctr R&D Investment & Strategy Res, Seoul, South Korea
[2] Ecole Natl Super Arts & Metiers, Paris, France
[3] Univ Calif Los Angeles, Sch Educ & Informat Studies, Los Angeles, CA USA
[4] Korea Inst Sci & Technol Informat, Ctr Future Technol Anal, Seoul, South Korea
来源
PLOS ONE | 2024年 / 19卷 / 05期
关键词
D O I
10.1371/journal.pone.0303889
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The prediction of depression is a crucial area of research which makes it one of the top priorities in mental health research as it enables early intervention and can lead to higher success rates in treatment. Self-reported feelings by patients represent a valuable biomarker for predicting depression as they can be expressed in a lower-dimensional network form, offering an advantage in visualizing the interactive characteristics of depression-related feelings. Furthermore, the network form of data expresses high-dimensional data in a compact form, making the data easy to use as input for the machine learning processes. In this study, we applied the graph convolutional network (GCN) algorithm, an effective machine learning tool for handling network data, to predict depression-prone patients using the network form of self-reported log data as the input. We took a data augmentation step to expand the initially small dataset and fed the resulting data into the GCN algorithm, which achieved a high level of accuracy from 86-97% and an F1 (harmonic mean of precision and recall) score of 0.83-0.94 through three experimental cases. In these cases, the ratio of depressive cases varied, and high accuracy and F1 scores were observed in all three cases. This study not only demonstrates the potential for predicting depression-prone patients using self-reported logs as a biomarker in advance, but also shows promise in handling small data sets in the prediction, which is critical given the challenge of obtaining large datasets for biomarker research. The combination of self-reported logs and the GCN algorithm is a promising approach for predicting depression and warrants further investigation.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Machine Learning-based Crop Yield Prediction by Data Augmentation
    Balmumcu, Alper
    Kayabol, Koray
    Erten, Esra
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [32] Identification of Risk Factors and Machine Learning-Based Prediction Models for Knee Osteoarthritis Patients
    Kokkotis, Christos
    Moustakidis, Serafeim
    Giakas, Giannis
    Tsaopoulos, Dimitrios
    APPLIED SCIENCES-BASEL, 2020, 10 (19):
  • [33] Machine learning-based risk prediction of malignant arrhythmia in hospitalized patients with heart failure
    Wang, Qi
    Li, Bin
    Chen, Kangyu
    Yu, Fei
    Su, Hao
    Hu, Kai
    Liu, Zhiquan
    Wu, Guohong
    Yan, Ji
    Su, Guohai
    ESC HEART FAILURE, 2022, 8 (06): : 5363 - 5371
  • [34] Machine learning-based risk prediction of intrahospital clinical outcomes in patients undergoing TAVI
    Gomes, Bruna
    Pilz, Maximilian
    Reich, Christoph
    Leuschner, Florian
    Konstandin, Mathias
    Katus, Hugo A.
    Meder, Benjamin
    CLINICAL RESEARCH IN CARDIOLOGY, 2021, 110 (03) : 343 - 356
  • [35] Machine learning-based risk prediction of malignant arrhythmia in hospitalized patients with heart failure
    Wang, Qi
    Li, Bin
    Chen, Kangyu
    Yu, Fei
    Su, Hao
    Hu, Kai
    Liu, Zhiquan
    Wu, Guohong
    Yan, Ji
    Su, Guohai
    ESC HEART FAILURE, 2021, 8 (06): : 5363 - 5371
  • [36] Machine learning-based risk prediction of intrahospital clinical outcomes in patients undergoing TAVI
    Bruna Gomes
    Maximilian Pilz
    Christoph Reich
    Florian Leuschner
    Mathias Konstandin
    Hugo A. Katus
    Benjamin Meder
    Clinical Research in Cardiology, 2021, 110 : 343 - 356
  • [37] Machine learning-based risk prediction model for cardiovascular disease using a hybrid dataset
    Kanagarathinam, Karthick
    Sankaran, Durairaj
    Manikandan, R.
    DATA & KNOWLEDGE ENGINEERING, 2022, 140
  • [38] Developing an explainable rockburst risk prediction method using monitored microseismicity based on interpretable machine learning approach
    Basnet, Prabhat Man Singh
    Jin, Aibing
    Mahtab, Shakil
    ACTA GEOPHYSICA, 2024, 72 (04) : 2597 - 2618
  • [39] A machine learning-based approach for the prediction of periprocedural myocardial infarction by using routine data
    Wang, Yao
    Zhu, Kangjun
    Li, Ya
    Lv, Qingbo
    Fu, Guosheng
    Zhang, Wenbin
    CARDIOVASCULAR DIAGNOSIS AND THERAPY, 2020, 10 (05) : 1313 - 1324
  • [40] MACHINE LEARNING-BASED PREDICTION OF ICU COMPLICATIONS USING MEDICATION DATA: A VALIDATION STUDY
    Smith, Susan
    Zhao, Bokai
    Deng, Shiyuan
    Hu, Mengxuan
    Zhang, Tianyi
    Kong, Yanlei
    Shen, Ye
    Li, Sheng
    Murphy, David
    Murray, Brian
    Kamaleswaran, Rishikesan
    Chen, Xianyan
    Devlin, John
    Sikora, Andrea
    CRITICAL CARE MEDICINE, 2025, 53 (01)