Multi-modality risk prediction of cardiovascular diseases for breast cancer cohort in the All of Us Research Program

被引:1
|
作者
Yang, Han [1 ]
Zhou, Sicheng [1 ]
Rao, Zexi [2 ]
Zhao, Chen [2 ]
Cui, Erjia [2 ]
Shenoy, Chetan [3 ]
Blaes, Anne H. [4 ]
Paidimukkala, Nishitha [1 ]
Wang, Jinhua [5 ]
Hou, Jue [2 ]
Zhang, Rui [6 ]
机构
[1] Univ Minnesota, Inst Hlth Informat, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Biostat & Hlth Data Sci, 2221 Univ Ave SE,Suite 200, Minneapolis, MN 55414 USA
[3] Univ Minnesota, Med Ctr, Dept Med, Cardiovasc Div, Minneapolis, MN 55455 USA
[4] Univ Minnesota, Div Hematol Oncol & Transplantat, Minneapolis, MN 55455 USA
[5] Univ Minnesota, Masonic Canc Ctr, Minneapolis, MN 55455 USA
[6] Univ Minnesota, Dept Surg, Div Comp Hlth Sci, 308 Harvard St SE, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
cardiovascular disease; breast cancer; predictive model; All of Us; SOCIAL DETERMINANTS; SURVIVAL; MODELS; TIME; ASSOCIATIONS; STATEMENT; SELECTION; IMPACT; INDEX; LASSO;
D O I
10.1093/jamia/ocae199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study leverages the rich diversity of the All of Us Research Program (All of Us)'s dataset to devise a predictive model for cardiovascular disease (CVD) in breast cancer (BC) survivors. Central to this endeavor is the creation of a robust data integration pipeline that synthesizes electronic health records (EHRs), patient surveys, and genomic data, while upholding fairness across demographic variables.Materials and Methods We have developed a universal data wrangling pipeline to process and merge heterogeneous data sources of the All of Us dataset, address missingness and variance in data, and align disparate data modalities into a coherent framework for analysis. Utilizing a composite feature set including EHR, lifestyle, and social determinants of health (SDoH) data, we then employed Adaptive Lasso and Random Forest regression models to predict 6 CVD outcomes. The models were evaluated using the c-index and time-dependent Area Under the Receiver Operating Characteristic Curve over a 10-year period.Results The Adaptive Lasso model showed consistent performance across most CVD outcomes, while the Random Forest model excelled particularly in predicting outcomes like transient ischemic attack when incorporating the full multi-model feature set. Feature importance analysis revealed age and previous coronary events as dominant predictors across CVD outcomes, with SDoH clustering labels highlighting the nuanced impact of social factors.Discussion The development of both Cox-based predictive model and Random Forest Regression model represents the extensive application of the All of Us, in integrating EHR and patient surveys to enhance precision medicine. And the inclusion of SDoH clustering labels revealed the significant impact of sociobehavioral factors on patient outcomes, emphasizing the importance of comprehensive health determinants in predictive models. Despite these advancements, limitations include the exclusion of genetic data, broad categorization of CVD conditions, and the need for fairness analyses to ensure equitable model performance across diverse populations. Future work should refine clinical and social variable measurements, incorporate advanced imputation techniques, and explore additional predictive algorithms to enhance model precision and fairness.Conclusion This study demonstrates the liability of the All of Us's diverse dataset in developing a multi-modality predictive model for CVD in BC survivors risk stratification in oncological survivorship. The data integration pipeline and subsequent predictive models establish a methodological foundation for future research into personalized healthcare.
引用
收藏
页码:2800 / 2810
页数:11
相关论文
共 50 条
  • [1] Psychosocial impact of a multi-modality surveillance program for women at high-risk for breast cancer
    Amico, Andrea L.
    Fang, Raymond
    Raoul, Akila
    Wroblewski, Kristen
    Nielsen, Sarah
    Weipert, Caroline
    Abe, Hiroyuki
    Sheth, Deepa
    Romero, Iris
    Kulkarni, Kirti
    Schacht, David
    Patrick-Miller, Linda
    Verp, Marion
    Bradbury, Angela R.
    Hlubocky, Fay
    Olopade, Olufunmilayo I.
    CANCER RESEARCH, 2018, 78 (04)
  • [2] Multi-modality imaging of a murine mammary window chamber for breast cancer research
    Schafer, Rachel
    Leung, Hui Min
    Gmitro, Arthur F.
    BIOTECHNIQUES, 2014, 57 (01) : 45 - 50
  • [3] MULTI-MODALITY MOLECULAR IMAGING FOR GASTRIC CANCER RESEARCH
    Liang, Jimin
    Chen, Xueli
    Liu, Junting
    Hu, Hao
    Qu, Xiaochao
    Wang, Fu
    Nie, Yongzhan
    2011 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE AND EXHIBITION (ACP), 2012,
  • [4] Multi-Modality Imaging in the Assessment of Cardiovascular Toxicity in the Cancer Patient
    Plana, Juan Carlos
    Thavendiranathan, Paaladinesh
    Bucciarelli-Ducci, Chiara
    Lancellotti, Patrizio
    JACC-CARDIOVASCULAR IMAGING, 2018, 11 (08) : 1173 - 1186
  • [5] MULTI-MODALITY MOLECULAR IMAGING FOR GASTRIC CANCER RESEARCH
    Liang, Jimin
    Chen, Xueli
    Liu, Junting
    Hu, Hao
    Qu, Xiaochao
    Wang, Fu
    Nie, Yongzhan
    OPTICAL SENSORS AND BIOPHOTONICS III, 2011, 8311
  • [6] Air quality and cancer risk in the All of Us Research Program
    Craver, Andrew
    Luo, Jiajun
    Kibriya, Muhammad G.
    Randorf, Nina
    Bahl, Kendall
    Connellan, Elizabeth
    Powell, Johnny
    Zakin, Paul
    Jones, Rena R.
    Argos, Maria
    Ho, Joyce
    Kim, Karen
    Daviglus, Martha L.
    Greenland, Philip
    Ahsan, Habibul
    Aschebrook-Kilfoy, Briseis
    CANCER CAUSES & CONTROL, 2024, 35 (05) : 749 - 760
  • [7] Air quality and cancer risk in the All of Us Research Program
    Andrew Craver
    Jiajun Luo
    Muhammad G. Kibriya
    Nina Randorf
    Kendall Bahl
    Elizabeth Connellan
    Johnny Powell
    Paul Zakin
    Rena R. Jones
    Maria Argos
    Joyce Ho
    Karen Kim
    Martha L. Daviglus
    Philip Greenland
    Habibul Ahsan
    Briseis Aschebrook-Kilfoy
    Cancer Causes & Control, 2024, 35 : 749 - 760
  • [8] A Novel Multi-modality Image-guided US-NIR Scanner for Breast Cancer Diagnosis
    Winey, B. A.
    Liao, L.
    Zhang, Y.
    Helbig, J.
    Misic, V.
    Parker, K.
    Podder, T.
    Yu, Y.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2006, VOL 14, PTS 1-6, 2007, 14 : 1396 - +
  • [9] A review of optical breast imaging: Multi-modality systems for breast cancer diagnosis
    Zhu, Quing
    Poplack, Steven
    EUROPEAN JOURNAL OF RADIOLOGY, 2020, 129
  • [10] Preoperative prediction of perineural invasion with multi-modality radiomics in rectal cancer
    Yu Guo
    Quan Wang
    Yan Guo
    Yiying Zhang
    Yu Fu
    Huimao Zhang
    Scientific Reports, 11