Artificial intelligence methods applied to longitudinal data from electronic health records for prediction of cancer: a scoping review

被引:1
|
作者
Moglia, Victoria [1 ]
Johnson, Owen [1 ]
Cook, Gordon [2 ,3 ]
de Kamps, Marc [1 ]
Smith, Lesley [2 ]
机构
[1] Univ Leeds, Sch Comp, Woodhouse Lane, Leeds LS2 9JT, England
[2] Univ Leeds, Leeds Inst Clin Trials Res, Clarendon Way, Leeds LS2 9NL, England
[3] NIHR Leeds Biomed Res Ctr, Chapeltown Rd, Leeds LS7 4SA, England
基金
英国科研创新办公室;
关键词
Machine learning; Health data; Longitudinal data; Cancer; Time-series; Temporal; Artificial intelligence; DEEP LEARNING ALGORITHM; COLORECTAL-CANCER; PANCREATIC-CANCER; RISK PREDICTION; EARLY-DIAGNOSIS; TIME; MODELS;
D O I
10.1186/s12874-025-02473-w
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundEarly detection and diagnosis of cancer are vital to improving outcomes for patients. Artificial intelligence (AI) models have shown promise in the early detection and diagnosis of cancer, but there is limited evidence on methods that fully exploit the longitudinal data stored within electronic health records (EHRs). This review aims to summarise methods currently utilised for prediction of cancer from longitudinal data and provides recommendations on how such models should be developed.MethodsThe review was conducted following PRISMA-ScR guidance. Six databases (MEDLINE, EMBASE, Web of Science, IEEE Xplore, PubMed and SCOPUS) were searched for relevant records published before 2/2/2024. Search terms related to the concepts "artificial intelligence", "prediction", "health records", "longitudinal", and "cancer". Data were extracted relating to several areas of the articles: (1) publication details, (2) study characteristics, (3) input data, (4) model characteristics, (4) reproducibility, and (5) quality assessment using the PROBAST tool. Models were evaluated against a framework for terminology relating to reporting of cancer detection and risk prediction models.ResultsOf 653 records screened, 33 were included in the review; 10 predicted risk of cancer, 18 performed either cancer detection or early detection, 4 predicted recurrence, and 1 predicted metastasis. The most common cancers predicted in the studies were colorectal (n = 9) and pancreatic cancer (n = 9). 16 studies used feature engineering to represent temporal data, with the most common features representing trends. 18 used deep learning models which take a direct sequential input, most commonly recurrent neural networks, but also including convolutional neural networks and transformers. Prediction windows and lead times varied greatly between studies, even for models predicting the same cancer. High risk of bias was found in 90% of the studies. This risk was often introduced due to inappropriate study design (n = 26) and sample size (n = 26).ConclusionThis review highlights the breadth of approaches to cancer prediction from longitudinal data. We identify areas where reporting of methods could be improved, particularly regarding where in a patients' trajectory the model is applied. The review shows opportunities for further work, including comparison of these approaches and their applications in other cancers.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] An artificial intelligence framework integrating longitudinal electronic health records with real-world data enables continuous pan-cancer prognostication
    Morin, Olivier
    Vallieres, Martin
    Braunstein, Steve
    Ginart, Jorge Barrios
    Upadhaya, Taman
    Woodruff, Henry C.
    Zwanenburg, Alex
    Chatterjee, Avishek
    Villanueva-Meyer, Javier E.
    Valdes, Gilmer
    Chen, William
    Hong, Julian C.
    Yom, Sue S.
    Solberg, Timothy D.
    Lock, Steffen
    Seuntjens, Jan
    Park, Catherine
    Lambin, Philippe
    NATURE CANCER, 2021, 2 (07) : 709 - +
  • [22] Early Detection of Pancreatic Cancer Applying Artificial Intelligence to Electronic Health Records
    Kenner, Barbara J.
    Abrams, Natalie D.
    Chari, Suresh T.
    Field, Bruce F.
    Goldberg, Ann E.
    Hoos, William A.
    Klimstra, David S.
    Rothschild, Laura J.
    Srivastava, Sudhir
    Young, Matthew R.
    Go, Vay Liang W.
    PANCREAS, 2021, 50 (07) : 916 - 922
  • [23] Primer on Artificial Intelligence Used in Electronic Health Records
    Harrington, Linda
    AACN ADVANCED CRITICAL CARE, 2022, 33 (02) : 130 - 133
  • [24] Patients Managing Their Medical Data in Personal Electronic Health Records: Scoping Review
    Damen, Debby J.
    Schoonman, Guus G.
    Maat, Barbara
    Habibovic, Mirela
    Krahmer, Emiel
    Pauws, Steffen
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [25] Prediction of Clinical Outcomes in Psychotic Disorders Using Artificial Intelligence Methods: A Scoping Review
    Tay, Jing Ling
    Htun, Kyawt Kyawt
    Sim, Kang
    BRAIN SCIENCES, 2024, 14 (09)
  • [26] The application of explainable artificial intelligence (XAI) in electronic health record research: A scoping review
    Caterson, Jessica
    Lewin, Alexandra
    Williamson, Elizabeth
    DIGITAL HEALTH, 2024, 10
  • [27] Screening of Key Transcripts from Expression Data Using Applied Artificial Intelligence for Cancer Prediction
    Pratap, Anju
    Hamada, Michiaki
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [28] Digital signatures in electronic health records: a scoping review
    Felisberto, Mariano
    de Oliveira, Julia Meller Dias
    Mohr, Eduarda Talita Bramorski
    Celuppi, Ianka Cristina
    Zanotto, Wagner Luiz
    dos Santos, Ranieri Alves
    Scandolara, Daniel Henrique
    Fantonelli, Miliane dos Santos
    Cunha, Celio Luiz
    Hammes, Jades Fernando
    Wazlawick, Raul Sidnei
    Dalmarco, Eduardo Monguilhott
    HEALTH AND TECHNOLOGY, 2024, 14 (06) : 1083 - 1096
  • [29] Shareable artificial intelligence to extract cancer outcomes from electronic health records for precision oncology research
    Kehl, Kenneth L.
    Jee, Justin
    Pichotta, Karl
    Paul, Morgan A.
    Trukhanov, Pavel
    Fong, Christopher
    Waters, Michele
    Bakouny, Ziad
    Xu, Wenxin
    Choueiri, Toni K.
    Nichols, Chelsea
    Schrag, Deborah
    Schultz, Nikolaus
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [30] Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review
    Payrovnaziri, Seyedeh Neelufar
    Chen, Zhaoyi
    Rengifo-Moreno, Pablo
    Miller, Tim
    Bian, Jiang
    Chen, Jonathan H.
    Liu, Xiuwen
    He, Zhe
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (07) : 1173 - 1185