Enhancing patient representation learning with inferred family pedigrees improves disease risk prediction

被引:0
|
作者
Huang, Xiayuan [1 ]
Arora, Jatin [2 ]
Erzurumluoglu, Abdullah Mesut [2 ]
Stanhope, Stephen A. [3 ]
Lam, Daniel [4 ]
Arora, Jatin [2 ]
Erzurumluoglu, Abdullah Mesut [2 ]
Lam, Daniel [4 ]
Khoueiry, Pierre
Jensen, Jan N.
Cai, James
Lawless, Nathan
Kriegl, Jan
Ding, Zhihao
de Jong, Johann [6 ,7 ]
Zhao, Hongyu [1 ]
Ding, Zhihao
Wang, Zuoheng [1 ,2 ,5 ]
de Jong, Johann [6 ,7 ]
机构
[1] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT 06510 USA
[2] Boehringer Ingelheim Pharm GmbH & Co KG, Global Computat Biol & Digital Sci, Human Genet, D-88400 Biberach, Germany
[3] Boehringer Ingelheim GmbH & Co KG, Real World Data & Analyt, Global Med Affairs, Ridgefield, CT 06877 USA
[4] Boehringer Ingelheim Pharm GmbH & Co KG, CB CMDR, Global Computat Biol & Digital Sci, D-88400 Biberach, Germany
[5] Yale Univ, Sch Med, Dept Biomed Informat & Data Sci, New Haven, CT 06510 USA
[6] Boehringer Ingelheim Pharm GmbH & Co KG, Global Computat Biol & Digital Sci, Stat Modeling, D-88400 Biberach, Germany
[7] UCB Biosci GmbH, Adv Analyt Patient Solut, D-40789 Monheim, Germany
关键词
electronic health records; patient modeling; disease risk prediction; graph attention networks; ULCERATIVE-COLITIS; HERITABILITY; HISTORY; RECORD;
D O I
10.1093/jamia/ocae297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background Machine learning and deep learning are powerful tools for analyzing electronic health records (EHRs) in healthcare research. Although family health history has been recognized as a major predictor for a wide spectrum of diseases, research has so far adopted a limited view of family relations, essentially treating patients as independent samples in the analysis.Methods To address this gap, we present ALIGATEHR, which models inferred family relations in a graph attention network augmented with an attention-based medical ontology representation, thus accounting for the complex influence of genetics, shared environmental exposures, and disease dependencies.Results Taking disease risk prediction as a use case, we demonstrate that explicitly modeling family relations significantly improves predictions across the disease spectrum. We then show how ALIGATEHR's attention mechanism, which links patients' disease risk to their relatives' clinical profiles, successfully captures genetic aspects of diseases using longitudinal EHR diagnosis data. Finally, we use ALIGATEHR to successfully distinguish the 2 main inflammatory bowel disease subtypes with highly shared risk factors and symptoms (Crohn's disease and ulcerative colitis).Conclusion Overall, our results highlight that family relations should not be overlooked in EHR research and illustrate ALIGATEHR's great potential for enhancing patient representation learning for predictive and interpretable modeling of EHRs.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [31] A deep learning system for retinal vessel calibre improves cardiovascular risk prediction in Asians with chronic kidney disease
    Lim, Cynthia Ciwei
    Chong, Crystal
    Tan, Gavin
    Tan, Chieh Suai
    Cheung, Carol Y.
    Wong, Tien Y.
    Cheng, Ching Yu
    Sabanayagam, Charumathi
    CLINICAL KIDNEY JOURNAL, 2023, 16 (12) : 2693 - 2702
  • [32] DeepMicro: deep representation learning for disease prediction based on microbiome data
    Oh, Min
    Zhang, Liqing
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [33] EAMNet: an Alzheimer's disease prediction model based on representation learning
    Duan, Haoliang
    Wang, Huabin
    Chen, Yonglin
    Liu, Fei
    Tao, Liang
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (21):
  • [34] Prediction of LncRNA-Disease Associations Based on Network Representation Learning
    Su, Xiaorui
    You, Zhuhong
    Yi, Haicheng
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1805 - 1812
  • [35] DeepMicro: deep representation learning for disease prediction based on microbiome data
    Min Oh
    Liqing Zhang
    Scientific Reports, 10
  • [36] Deep learning-based classification of mesothelioma improves prediction of patient outcome
    Courtiol, Pierre
    Maussion, Charles
    Moarii, Matahi
    Pronier, Elodie
    Pilcer, Samuel
    Sefta, Meriem
    Manceron, Pierre
    Toldo, Sylvain
    Zaslavskiy, Mikhail
    Le Stang, Nolwenn
    Girard, Nicolas
    Elemento, Olivier
    Nicholson, Andrew G.
    Blay, Jean-Yves
    Galateau-Salle, Francoise
    Wainrib, Gilles
    Clozel, Thomas
    NATURE MEDICINE, 2019, 25 (10) : 1519 - +
  • [37] Deep learning-based classification of mesothelioma improves prediction of patient outcome
    Pierre Courtiol
    Charles Maussion
    Matahi Moarii
    Elodie Pronier
    Samuel Pilcer
    Meriem Sefta
    Pierre Manceron
    Sylvain Toldo
    Mikhail Zaslavskiy
    Nolwenn Le Stang
    Nicolas Girard
    Olivier Elemento
    Andrew G. Nicholson
    Jean-Yves Blay
    Françoise Galateau-Sallé
    Gilles Wainrib
    Thomas Clozel
    Nature Medicine, 2019, 25 : 1519 - 1525
  • [38] Enhancing patient outcomes through machine learning: A study of lung cancer prediction
    Bajaj, Madhvan
    Rawat, Priyanshu
    Vats, Satvik
    Sharma, Vikrant
    Mehta, Shreshtha
    Sagar, B. B.
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (06): : 1075 - 1086
  • [39] Machine learning improves mortality prediction in three-vessel disease
    Feng, Xinxing
    Zhang, Ce
    Huang, Xin
    Liu, Junhao
    Xu, Lianjun
    Jiang, Lin
    Tian, Jian
    Zhao, Xueyan
    Wang, Dong
    Zhang, Yin
    Sun, Kai
    Xu, Bo
    Zhao, Wei
    Hui, Rutai
    Gao, Runlin
    Yuan, Jinqing
    Wang, Jizheng
    Duan, Yanfeng
    Song, Lei
    ATHEROSCLEROSIS, 2023, 367 : 1 - 7
  • [40] Enhancing the preciseness of prediction in heart disease diagnosis by utilizing machine learning
    Thirunavukkarasu, J.
    Chinnasamy, A.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,