Artificial intelligence methods applied to longitudinal data from electronic health records for prediction of cancer: a scoping review

被引:1
|
作者
Moglia, Victoria [1 ]
Johnson, Owen [1 ]
Cook, Gordon [2 ,3 ]
de Kamps, Marc [1 ]
Smith, Lesley [2 ]
机构
[1] Univ Leeds, Sch Comp, Woodhouse Lane, Leeds LS2 9JT, England
[2] Univ Leeds, Leeds Inst Clin Trials Res, Clarendon Way, Leeds LS2 9NL, England
[3] NIHR Leeds Biomed Res Ctr, Chapeltown Rd, Leeds LS7 4SA, England
基金
英国科研创新办公室;
关键词
Machine learning; Health data; Longitudinal data; Cancer; Time-series; Temporal; Artificial intelligence; DEEP LEARNING ALGORITHM; COLORECTAL-CANCER; PANCREATIC-CANCER; RISK PREDICTION; EARLY-DIAGNOSIS; TIME; MODELS;
D O I
10.1186/s12874-025-02473-w
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundEarly detection and diagnosis of cancer are vital to improving outcomes for patients. Artificial intelligence (AI) models have shown promise in the early detection and diagnosis of cancer, but there is limited evidence on methods that fully exploit the longitudinal data stored within electronic health records (EHRs). This review aims to summarise methods currently utilised for prediction of cancer from longitudinal data and provides recommendations on how such models should be developed.MethodsThe review was conducted following PRISMA-ScR guidance. Six databases (MEDLINE, EMBASE, Web of Science, IEEE Xplore, PubMed and SCOPUS) were searched for relevant records published before 2/2/2024. Search terms related to the concepts "artificial intelligence", "prediction", "health records", "longitudinal", and "cancer". Data were extracted relating to several areas of the articles: (1) publication details, (2) study characteristics, (3) input data, (4) model characteristics, (4) reproducibility, and (5) quality assessment using the PROBAST tool. Models were evaluated against a framework for terminology relating to reporting of cancer detection and risk prediction models.ResultsOf 653 records screened, 33 were included in the review; 10 predicted risk of cancer, 18 performed either cancer detection or early detection, 4 predicted recurrence, and 1 predicted metastasis. The most common cancers predicted in the studies were colorectal (n = 9) and pancreatic cancer (n = 9). 16 studies used feature engineering to represent temporal data, with the most common features representing trends. 18 used deep learning models which take a direct sequential input, most commonly recurrent neural networks, but also including convolutional neural networks and transformers. Prediction windows and lead times varied greatly between studies, even for models predicting the same cancer. High risk of bias was found in 90% of the studies. This risk was often introduced due to inappropriate study design (n = 26) and sample size (n = 26).ConclusionThis review highlights the breadth of approaches to cancer prediction from longitudinal data. We identify areas where reporting of methods could be improved, particularly regarding where in a patients' trajectory the model is applied. The review shows opportunities for further work, including comparison of these approaches and their applications in other cancers.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
    Carrasco-Ribelles, Lucia A.
    Llanes-Jurado, Jose
    Gallego-Moll, Carlos
    Cabrera-Bean, Margarita
    Monteagudo-Zaragoza, Monica
    Violan, Concepcion
    Zabaleta-del-Olmo, Edurne
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (12) : 2072 - 2082
  • [2] Artificial intelligence-based methods for fusion of electronic health records and imaging data
    Mohsen, Farida
    Ali, Hazrat
    El Hajj, Nady
    Shah, Zubair
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] Artificial intelligence-based methods for fusion of electronic health records and imaging data
    Farida Mohsen
    Hazrat Ali
    Nady El Hajj
    Zubair Shah
    Scientific Reports, 12
  • [4] Shareable artificial intelligence to extract cancer outcomes from electronic health records
    Kehl, Kenneth L.
    Jee, Justin
    Pichotta, Karl
    Trukhanov, Pavel
    Fong, Christopher
    Waters, Michele
    Nichols, Chelsea
    Cerami, Ethan
    Schrag, Deb
    Schultz, Nikolaus
    JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (16)
  • [5] Harnessing Electronic Health Records and Artificial Intelligence for Enhanced Cardiovascular Risk Prediction: A Comprehensive Review
    Tsai, Ming-Lung
    Chen, Kuan-Fu
    Chen, Pei-Chun
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2025, 14 (06):
  • [6] Prediction of Brain Metastases Development in Patients With Lung Cancer by Explainable Artificial Intelligence From Electronic Health Records
    Li, Zhao
    Li, Rongbin
    Zhou, Yujia
    Rasmy, Laila
    Zhi, Degui
    Zhu, Ping
    Dono, Antonio
    Jiang, Xiaoqian
    Xu, Hua
    Esquenazi, Yoshua
    Zheng, W. Jim
    JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [7] Prediction of Brain Metastases Development in Patients With Lung Cancer by Explainable Artificial Intelligence From Electronic Health Records
    Li, Zhao
    Li, Rongbin
    Zhou, Yujia
    Rasmy, Laila
    Zhi, Degui
    Zhu, Ping
    Dono, Antonio
    Jiang, Xiaoqian
    Xu, Hua
    Esquenazi, Yoshua
    Zheng, Jim
    JCO CLINICAL CANCER INFORMATICS, 2023, 7 : e2200141
  • [8] Major areas of interest of artificial intelligence research applied to health care administrative data: a scoping review
    Bukhtiyarova, Olga
    Abderrazak, Amna
    Chiu, Yohann
    Sparano, Stephanie
    Simard, Marc
    Sirois, Caroline
    FRONTIERS IN PHARMACOLOGY, 2022, 13
  • [9] Research and Application of Artificial Intelligence Based on Electronic Health Records of Patients With Cancer: Systematic Review
    Yang, Xinyu
    Mu, Dongmei
    Peng, Hao
    Li, Hua
    Wang, Ying
    Wang, Ping
    Wang, Yue
    Han, Siqi
    JMIR MEDICAL INFORMATICS, 2022, 10 (04) : 36 - 46
  • [10] Integrating a data curation artificial intelligence system to identify cancer registry data elements from unstructured electronic health records
    Yang, Y-H.
    Dai, H-J.
    Tsai, J-H.
    Chang, Y-C.
    Wang, T-Y.
    Ke, C-R.
    Yu, S-J.
    Liu, Y-H.
    Huang, C-J.
    Lee, C-J.
    Lee, Y-H.
    Liang, J-R.
    Fu, W-M.
    Liao, S-C.
    Kuo, S-J.
    Huang, C-Y.
    Lin, L-J.
    Wu, C-C.
    Chang, G-C.
    Chong, I-W.
    ANNALS OF ONCOLOGY, 2024, 35 : S1678 - S1678