Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis

被引:0
|
作者
Liu, Lidan [1 ]
Liu, Lu [1 ]
Wafa, Hatem A. [1 ]
Tydeman, Florence [1 ]
Xie, Wanqing [2 ,3 ,4 ]
Wang, Yanzhong [1 ]
机构
[1] Kings Coll London, Fac Life Sci & Med, Sch Life Course & Populat Sci, Dept Populat Hlth Sci, 4th Floor,Addison House,Guys Campus, London SE1 1UL, England
[2] Anhui Med Univ, Sch Biomed Engn, Dept Intelligent Med Engn, Hefei 230032, Peoples R China
[3] Anhui Med Univ, Sch Mental Hlth & Psychol Sci, Dept Psychol, Hefei 230032, Peoples R China
[4] Harvard Univ, Harvard Med Sch, Beth Israel Deaconess Med Ctr, Boston, MA 02115 USA
关键词
depression; deep learning; speech; meta-analysis; systematic review; BURDEN;
D O I
10.1093/jamia/ocae189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study aims to conduct a systematic review and meta-analysis of the diagnostic accuracy of deep learning (DL) using speech samples in depression.Materials and Methods This review included studies reporting diagnostic results of DL algorithms in depression using speech data, published from inception to January 31, 2024, on PubMed, Medline, Embase, PsycINFO, Scopus, IEEE, and Web of Science databases. Pooled accuracy, sensitivity, and specificity were obtained by random-effect models. The diagnostic Precision Study Quality Assessment Tool (QUADAS-2) was used to assess the risk of bias.Results A total of 25 studies met the inclusion criteria and 8 of them were used in the meta-analysis. The pooled estimates of accuracy, specificity, and sensitivity for depression detection models were 0.87 (95% CI, 0.81-0.93), 0.85 (95% CI, 0.78-0.91), and 0.82 (95% CI, 0.71-0.94), respectively. When stratified by model structure, the highest pooled diagnostic accuracy was 0.89 (95% CI, 0.81-0.97) in the handcrafted group.Discussion To our knowledge, our study is the first meta-analysis on the diagnostic performance of DL for depression detection from speech samples. All studies included in the meta-analysis used convolutional neural network (CNN) models, posing problems in deciphering the performance of other DL algorithms. The handcrafted model performed better than the end-to-end model in speech depression detection.Conclusions The application of DL in speech provided a useful tool for depression detection. CNN models with handcrafted acoustic features could help to improve the diagnostic performance.Protocol registration The study protocol was registered on PROSPERO (CRD42023423603).
引用
收藏
页码:2394 / 2404
页数:11
相关论文
共 50 条
  • [31] Diagnostic accuracy of eFAST in the trauma patient: a systematic review and meta-analysis
    Netherton, Stuart
    Milenkovic, Velimir
    Taylor, Mark
    Davis, Philip J.
    [J]. CANADIAN JOURNAL OF EMERGENCY MEDICINE, 2019, 21 (06) : 727 - 738
  • [32] Diagnostic accuracy of clinical tests of the hip: a systematic review with meta-analysis
    Reiman, Michael P.
    Goode, Adam P.
    Hegedus, Eric J.
    Cook, Chad E.
    Wright, Alexis A.
    [J]. BRITISH JOURNAL OF SPORTS MEDICINE, 2013, 47 (14) : 893 - 902
  • [33] Proadrenomedullin and neonatal sepsis: a systematic review and meta-analysis of diagnostic accuracy
    Milas, Gerasimos Panagiotis
    Issaris, Vasileios
    [J]. EUROPEAN JOURNAL OF PEDIATRICS, 2022, 181 (01) : 59 - 71
  • [34] Proadrenomedullin and neonatal sepsis: a systematic review and meta-analysis of diagnostic accuracy
    Gerasimos Panagiotis Milas
    Vasileios Issaris
    [J]. European Journal of Pediatrics, 2022, 181 : 59 - 71
  • [35] Diagnostic accuracy of screening tests for COPD: a systematic review and meta-analysis
    Haroon, Shamil
    Jordan, Rachel
    Takwoingi, Yemisi
    Adab, Peymane
    [J]. BMJ OPEN, 2015, 5 (10):
  • [36] Diagnostic accuracy of echocardiography for pulmonary hypertension: a systematic review and meta-analysis
    Janda, Surinder
    Shahidi, Neal
    Gin, Kenneth
    Swiston, John
    [J]. HEART, 2011, 97 (08) : 612 - 622
  • [37] Accuracy of penicillin allergy diagnostic tests: A systematic review and meta-analysis
    Sousa-Pinto, Bernardo
    Tarrio, Isabel
    Blumenthal, Kimberly G.
    Araujo, Luis
    Azevedo, Luis Filipe
    Delgado, Luis
    Fonseca, Joao Almedia
    [J]. JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2021, 147 (01) : 296 - 308
  • [38] The accuracy of screening instruments for sarcopenia: a diagnostic systematic review and meta-analysis
    Huang, Li
    Shu, Xiaoyu
    Ge, Ning
    Gao, Langli
    Xu, Ping
    Zhang, Yu
    Chen, Yu
    Yue, Jirong
    Wu, Chenkai
    [J]. AGE AND AGEING, 2023, 52 (08)
  • [39] The accuracy of diagnostic indicators for coeliac disease: A systematic review and meta-analysis
    Elwenspoek, Martha M. C.
    Jackson, Joni
    O'Donnell, Rachel
    Sinobas, Anthony
    Dawson, Sarah
    Everitt, Hazel
    Gillett, Peter
    Hay, Alastair D.
    Lane, Deborah L.
    Mallett, Susan
    Robins, Gerry
    Watson, Jessica C.
    Jones, Hayley E.
    Whiting, Penny
    [J]. PLOS ONE, 2021, 16 (10):
  • [40] Accuracy of cholera rapid diagnostic tests: a systematic review and meta-analysis
    Muzembo, Basilua Andre
    Kitahara, Kei
    Debnath, Anusuya
    Okamoto, Keinosuke
    Miyoshi, Shin-Ichi
    [J]. CLINICAL MICROBIOLOGY AND INFECTION, 2022, 28 (02) : 155 - 162