A novel machine learning-based prediction method for patients at risk of developing depressive symptoms using a small data

被引:0
|
作者
Yun, Minyoung [1 ,2 ]
Jeon, Minjeong [3 ]
Yang, Heyoung [4 ]
机构
[1] Korea Inst Sci & Technol Informat, Ctr R&D Investment & Strategy Res, Seoul, South Korea
[2] Ecole Natl Super Arts & Metiers, Paris, France
[3] Univ Calif Los Angeles, Sch Educ & Informat Studies, Los Angeles, CA USA
[4] Korea Inst Sci & Technol Informat, Ctr Future Technol Anal, Seoul, South Korea
来源
PLOS ONE | 2024年 / 19卷 / 05期
关键词
D O I
10.1371/journal.pone.0303889
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The prediction of depression is a crucial area of research which makes it one of the top priorities in mental health research as it enables early intervention and can lead to higher success rates in treatment. Self-reported feelings by patients represent a valuable biomarker for predicting depression as they can be expressed in a lower-dimensional network form, offering an advantage in visualizing the interactive characteristics of depression-related feelings. Furthermore, the network form of data expresses high-dimensional data in a compact form, making the data easy to use as input for the machine learning processes. In this study, we applied the graph convolutional network (GCN) algorithm, an effective machine learning tool for handling network data, to predict depression-prone patients using the network form of self-reported log data as the input. We took a data augmentation step to expand the initially small dataset and fed the resulting data into the GCN algorithm, which achieved a high level of accuracy from 86-97% and an F1 (harmonic mean of precision and recall) score of 0.83-0.94 through three experimental cases. In these cases, the ratio of depressive cases varied, and high accuracy and F1 scores were observed in all three cases. This study not only demonstrates the potential for predicting depression-prone patients using self-reported logs as a biomarker in advance, but also shows promise in handling small data sets in the prediction, which is critical given the challenge of obtaining large datasets for biomarker research. The combination of self-reported logs and the GCN algorithm is a promising approach for predicting depression and warrants further investigation.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Machine learning-based prediction of diabetic patients using blood routine data
    Li, Honghao
    Su, Dongqing
    Zhang, Xinpeng
    He, Yuanyuan
    Luo, Xu
    Xiong, Yuqiang
    Zou, Min
    Wei, Huiyan
    Wen, Shaoran
    Xi, Qilemuge
    Zuo, Yongchun
    Yang, Lei
    METHODS, 2024, 229 : 156 - 162
  • [2] Machine learning-based approaches for cancer prediction using microbiome data
    Freitas, Pedro
    Silva, Francisco
    Sousa, Joana Vale
    Ferreira, Rui M.
    Figueiredo, Ceu
    Pereira, Tania
    Oliveira, Helder P.
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [3] Machine learning-based approaches for cancer prediction using microbiome data
    Pedro Freitas
    Francisco Silva
    Joana Vale Sousa
    Rui M. Ferreira
    Céu Figueiredo
    Tania Pereira
    Hélder P. Oliveira
    Scientific Reports, 13 (1)
  • [4] Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
    Yuta Takahashi
    Masao Ueki
    Makoto Yamada
    Gen Tamiya
    Ikuko N. Motoike
    Daisuke Saigusa
    Miyuki Sakurai
    Fuji Nagami
    Soichi Ogishima
    Seizo Koshiba
    Kengo Kinoshita
    Masayuki Yamamoto
    Hiroaki Tomita
    Translational Psychiatry, 10
  • [5] Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
    Takahashi, Yuta
    Ueki, Masao
    Yamada, Makoto
    Tamiya, Gen
    Motoike, Ikuko N.
    Saigusa, Daisuke
    Sakurai, Miyuki
    Nagami, Fuji
    Ogishima, Soichi
    Koshiba, Seizo
    Kinoshita, Kengo
    Yamamoto, Masayuki
    Tomita, Hiroaki
    TRANSLATIONAL PSYCHIATRY, 2020, 10 (01)
  • [6] A machine learning-based prediction model for QOL using lifelogs which are related symptoms suffering the patients
    Higashiyama, Nozomi
    Yamaguchi, Ken
    Inayama, Yoshihide
    Ueda, Akihiko
    Taki, Mana
    Kitamura, Sachiko
    Murakami, Ryusuke
    Yamanoi, Koji
    Hamanishi, Junzo
    Mandai, Masaki
    CANCER SCIENCE, 2023, 114 : 1265 - 1265
  • [7] Developing a machine learning-based flood risk prediction model for the Indus Basin in Pakistan
    Khan, Mehran
    Khan, Afed Ullah
    Ullah, Basir
    Khan, Sunaid
    WATER PRACTICE AND TECHNOLOGY, 2024, 19 (06) : 2213 - 2225
  • [8] Machine Learning-Based Prediction of Readmission Risk in Cardiovascular and Cerebrovascular Conditions Using Patient EMR Data
    Panchangam, Prasad V. R.
    Tejas, A.
    Thejas, B. U.
    Maniaci, Michael J.
    HEALTHCARE, 2024, 12 (15)
  • [9] Machine Learning-Based Mortality Prediction of Patients at Risk During Hospital Admission
    Trentino, Kevin M.
    Schwarzbauer, Karin
    Mitterecker, Andreas
    Hofmann, Axel
    Lloyd, Adam
    Leahy, Michael F.
    Tschoellitsch, Thomas
    Bock, Carl
    Hochreiter, Sepp
    Meier, Jens
    JOURNAL OF PATIENT SAFETY, 2022, 18 (05) : 494 - 498
  • [10] Machine Learning-Based Asthma Risk Prediction Using IoT and Smartphone Applications
    Bhat, Gautam S.
    Shankar, Nikhil
    Kim, Dohyeong
    Song, Dae Jin
    Seo, Sungchul
    Panahi, Issa M. S.
    Tamil, Lakshman
    IEEE ACCESS, 2021, 9 : 118708 - 118715