Development and multi-site external validation of a generalizable risk prediction model for bipolar disorder

被引:1
|
作者
Walsh, Colin G. [1 ]
Ripperger, Michael A. [1 ]
Hu, Yirui [2 ]
Sheu, Yi-han [3 ,4 ,5 ]
Lee, Hyunjoon [1 ]
Wilimitis, Drew [1 ]
Zheutlin, Amanda B. [3 ]
Rocha, Daniel [2 ]
Choi, Karmel W. [3 ]
Castro, Victor M. [3 ]
Kirchner, H. Lester [2 ]
Chabris, Christopher F. [2 ]
Davis, Lea K. [1 ]
Smoller, Jordan W. [3 ,4 ,5 ]
机构
[1] Vanderbilt Univ Med Ctr Hlth Syst, Nashville, TN 37232 USA
[2] Geisinger Hlth Syst, Danville, PA USA
[3] Massachusetts Gen Brigham Hlth Syst, Boston, MA USA
[4] Massachusetts Gen Hosp, Ctr Precis Psychiat, Dept Psychiat, Boston, MA USA
[5] Massachusetts Gen Hosp, Ctr Genom Med, Psychiat & Neurodev Genet Unit, Boston, MA USA
关键词
REGULARIZATION PATHS; UNTREATED ILLNESS; DURATION; HEALTH; MISDIAGNOSIS; SUICIDE; BIAS;
D O I
10.1038/s41398-023-02720-y
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Network across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and valid with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82-0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Network website.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Development and multi-site external validation of a generalizable risk prediction model for bipolar disorder
    Colin G. Walsh
    Michael A. Ripperger
    Yirui Hu
    Yi-han Sheu
    Hyunjoon Lee
    Drew Wilimitis
    Amanda B. Zheutlin
    Daniel Rocha
    Karmel W. Choi
    Victor M. Castro
    H. Lester Kirchner
    Christopher F. Chabris
    Lea K. Davis
    Jordan W. Smoller
    [J]. Translational Psychiatry, 14
  • [2] Psychoeducation versus CBT in bipolar disorder: A multi-site RCT
    Parikh, SV
    Velyvist, V
    Yatham, L
    Beaulieu, S
    Cervantes, P
    McQueen, G
    Siotis, I
    Streiner, D
    Zaretsky, A
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2006, 91 : S67 - S67
  • [3] The collaborative practice model for bipolar disorder: design and implementation in a multi-site randomized controlled trial
    Bauer, MS
    [J]. BIPOLAR DISORDERS, 2001, 3 (05) : 233 - 244
  • [4] Development and validation of a risk prediction model of opioid use disorder
    Escorial Garcia, Monica
    Muriel Serrano, Javier
    Londono Ramirez, Ana Carolina
    Panadero Alcala, Ana
    Peiro Peiro, Ana Maria
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2022, 130 : 13 - 13
  • [5] Development and external validation of a head and neck cancer risk prediction model
    Smith, Craig D. L.
    Mcmahon, Alex D.
    Lyall, Donald M.
    Goulart, Mariel
    Inman, Gareth J.
    Ross, Al
    Gormley, Mark
    Dudding, Tom
    Macfarlane, Gary J.
    Robinson, Max
    Richiardi, Lorenzo
    Serraino, Diego
    Polesel, Jerry
    Canova, Cristina
    Ahrens, Wolfgang
    Healy, Claire M.
    Lagiou, Pagona
    Holcatova, Ivana
    Alemany, Laia
    Znoar, Ariana
    Waterboer, Tim
    Brennan, Paul
    Virani, Shama
    Conway, David I.
    [J]. HEAD AND NECK-JOURNAL FOR THE SCIENCES AND SPECIALTIES OF THE HEAD AND NECK, 2024, 46 (09): : 2261 - 2273
  • [6] International multi-site survey on the use of online support groups in bipolar disorder
    Bauer, Rita
    Conell, Joern
    Glenn, Tasha
    Alda, Martin
    Ardau, Raffaella
    Baune, Bernhard T.
    Berk, Michael
    Bersudsky, Yuly
    Bilderbeck, Amy
    Bocchetta, Alberto
    Bossini, Letizia
    Castro, Angela M. Paredes
    Cheung, Eric Y. W.
    Chillotti, Caterina
    Choppin, Sabine
    Del Zompo, Maria
    Dias, Rodrigo
    Dodd, Seetal
    Duffy, Anne
    Etain, Bruno
    Fagiolini, Andrea
    Fernandez Hernandez, Miryam
    Garnham, Julie
    Geddes, John
    Gildebro, Jonas
    Gonzalez-Pinto, Ana
    Goodwin, Guy M.
    Grof, Paul
    Harima, Hirohiko
    Hassel, Stefanie
    Henry, Chantal
    Hidalgo-Mazzei, Diego
    Kapur, Vaisnvy
    Kunigiri, Girish
    Lafer, Beny
    Larsen, Erik R.
    Lewitzka, Ute
    Licht, Rasmus W.
    Lund, Anne Hvenegaard
    Misiak, Blazej
    Piotrowski, Patryk
    Monteith, Scott
    Munoz, Rodrigo
    Nakanotani, Takako
    Nielsen, Rene E.
    O'donovan, Claire
    Okamura, Yasushi
    Osher, Yamima
    Reif, Andreas
    Ritter, Philipp
    [J]. NORDIC JOURNAL OF PSYCHIATRY, 2017, 71 (06) : 473 - 476
  • [7] Digital Biomarkers Based Individualized Prognosis for People at Risk of Dementia: the AltoidaML Multi-site External Validation Study
    Rai, Laura
    Boyle, Rory
    Brosnan, Laura
    Rice, Hannah
    Farina, Francesca
    Tarnanas, Ioannis
    Whelan, Robert
    [J]. GENEDIS 2018: COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1194 : 157 - 171
  • [8] Development of a Generalizable Multi-site and Multi-Modality Clinical Data Cloud Infrastructure for Pediatric Patient Care
    Hornback, Andrew
    Shi, Wenqi
    Giuste, Felipe O.
    Zhu, Yuanda
    Carpenter, Ashley M.
    Hilton, Coleman
    Bijanki, Vinieth N.
    Stahl, Hiram
    Gottesman, Gary S.
    Purnell, Chad
    Iwinski, Henry J.
    Wattenbarger, J. Michael
    Wang, May D.
    [J]. 13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [9] Depressed, hyperactive, unstable? Bipolar! A multi-site study into prevalence and clinical implications of juvenile bipolar disorder
    Celestin-Westreich, S
    Celestin, LP
    [J]. JOURNAL OF AFFECTIVE DISORDERS, 2006, 91 : S75 - S75
  • [10] Frequency and clinical picture of Bipolar II disorder in a French multi-site study:: EPIDEP
    Allilaire, JF
    Hantouche, EG
    Sechter, D
    Bourgeois, ML
    Azorin, JM
    Lancrenon, S
    Châtenet-Duchêne, L
    Akiskal, HS
    [J]. ENCEPHALE-REVUE DE PSYCHIATRIE CLINIQUE BIOLOGIQUE ET THERAPEUTIQUE, 2001, 27 (02): : 149 - 158