Pancreatic cancer symptom trajectories from Danish registry data and free text in electronic health records

被引:2
|
作者
Hjaltelin, Jessica Xin [1 ]
Novitski, Sif Ingibergsdottir [1 ]
Jorgensen, Isabella Friis [1 ]
Siggaard, Troels [1 ]
Vulpius, Siri Amalie [1 ]
Westergaard, David [1 ]
Johansen, Julia Sidenius [2 ]
Chen, Inna M. [2 ]
Juhl Jensen, Lars [1 ]
Brunak, Soren [1 ,3 ]
机构
[1] Univ Copenhagen, Novo Nord Fdn Ctr Prot Res, Fac Hlth & Med Sci, Copenhagen, Denmark
[2] Copenhagen Univ Hosp Herlev & Gentofte, Dept Oncol, Herlev, Denmark
[3] Copenhagen Univ Hosp, Rigshosp, Copenhagen, Denmark
来源
ELIFE | 2023年 / 12卷
关键词
pancreas cancer; symptomology; disease progression; longitudinal analysis; patient stratification; Human; PRIMARY-CARE; RISK; DIAGNOSIS; DISEASE; IDENTIFICATION; TIME;
D O I
10.7554/eLife.84919
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pancreatic cancer is one of the deadliest cancer types with poor treatment options. Better detection of early symptoms and relevant disease correlations could improve pancreatic cancer prognosis. In this retrospective study, we used symptom and disease codes (ICD-10) from the Danish National Patient Registry (NPR) encompassing 6.9 million patients from 1994 to 2018,, of whom 23,592 were diagnosed with pancreatic cancer. The Danish cancer registry included 18,523 of these patients. To complement and compare the registry diagnosis codes with deeper clinical data, we used a text mining approach to extract symptoms from free text clinical notes in electronic health records (3078 pancreatic cancer patients and 30,780 controls). We used both data sources to generate and compare symptom disease trajectories to uncover temporal patterns of symptoms prior to pancreatic cancer diagnosis for the same patients. We show that the text mining of the clinical notes was able to complement the registry-based symptoms by capturing more symptoms prior to pancreatic cancer diagnosis. For example, 'Blood pressure reading without diagnosis', 'Abnormalities of heartbeat', and 'Intestinal obstruction' were not found for the registry-based analysis. Chaining symptoms together in trajectories identified two groups of patients with lower median survival (<90 days) following the trajectories 'Cough -> Jaundice -> Intestinal obstruction' and 'Pain -> Jaundice -> Abnormal results of function studies'. These results provide a comprehensive comparison of the two types of pancreatic cancer symptom trajectories, which in combination can leverage the full potential of the health data and ultimately provide a fuller picture for detection of early risk factors for pancreatic cancer.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Analysis of free text in electronic health records for identification of cancer patient trajectories
    Jensen, Kasper
    Soguero-Ruiz, Cristina
    Mikalsen, Karl Oyvind
    Lindsetmo, Rolv-Ole
    Kouskoumvekaki, Irene
    Girolami, Mark
    Skrovseth, Stein Olav
    Augestad, Knut Magne
    SCIENTIFIC REPORTS, 2017, 7
  • [2] Analysis of free text in electronic health records for identification of cancer patient trajectories
    Kasper Jensen
    Cristina Soguero-Ruiz
    Karl Oyvind Mikalsen
    Rolv-Ole Lindsetmo
    Irene Kouskoumvekaki
    Mark Girolami
    Stein Olav Skrovseth
    Knut Magne Augestad
    Scientific Reports, 7
  • [3] Extracting Standardized Visual Acuity Data from Free-Text Electronic Health Records: The 'visualacuity' Toolkit
    Gale, Robert C.
    Halfpenny, Will
    Bedrick, Steven
    Hribar, Michelle
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [4] Using anchors from free text in electronic health records to diagnose postoperative delirium
    Mikalsen, Karl Oyvind
    Soguero-Ruiz, Cristina
    Jensen, Kasper
    Hindberg, Kristian
    Gran, Mads
    Revhaug, Arthur
    Lindsetmo, Rolv-Ole
    Skrovseth, Stein Olav
    Godtliebsen, Fred
    Jenssen, Robert
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 152 : 105 - 114
  • [5] Integrating a data curation artificial intelligence system to identify cancer registry data elements from unstructured electronic health records
    Yang, Y-h.
    Dai, H-j.
    Tsai, J-h.
    Chang, Y-c.
    Wang, T-y.
    Ke, C-r.
    Yu, S-j.
    Liu, Y-h.
    Huang, C-j.
    Lee, C-j.
    Lee, Y-h.
    Liang, J-r.
    Fu, W-m.
    Liao, S-c.
    Kuo, S-j.
    Huang, C-y.
    Lin, L-j.
    Wu, C-c.
    Chang, G-c.
    Chong, I-w.
    ANNALS OF ONCOLOGY, 2024, 35
  • [6] Integrating a data curation artificial intelligence system to identify cancer registry data elements from unstructured electronic health records
    Yang, Y-H.
    Dai, H-J.
    Tsai, J-H.
    Chang, Y-C.
    Wang, T-Y.
    Ke, C-R.
    Yu, S-J.
    Liu, Y-H.
    Huang, C-J.
    Lee, C-J.
    Lee, Y-H.
    Liang, J-R.
    Fu, W-M.
    Liao, S-C.
    Kuo, S-J.
    Huang, C-Y.
    Lin, L-J.
    Wu, C-C.
    Chang, G-C.
    Chong, I-W.
    ANNALS OF ONCOLOGY, 2024, 35 : S1678 - S1678
  • [7] De-identifying free text of Japanese electronic health records
    Kohei Kajiyama
    Hiromasa Horiguchi
    Takashi Okumura
    Mizuki Morita
    Yoshinobu Kano
    Journal of Biomedical Semantics, 11
  • [8] Automated Classification of Free Text Electronic Health Records for Epidemiological Studies
    Schuemie, Martijn J.
    Sen, Emine
    van Soest, Eva M.
    Sturkenboom, Miriam C.
    Kors, Jan A.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2010, 19 : S106 - S106
  • [9] De-identifying free text of Japanese electronic health records
    Kajiyama, Kohei
    Horiguchi, Hiromasa
    Okumura, Takashi
    Morita, Mizuki
    Kano, Yoshinobu
    JOURNAL OF BIOMEDICAL SEMANTICS, 2020, 11 (01)
  • [10] From free-text electronic health records to structured cohorts: Onconum, an innovative methodology for real-world data mining in breast cancer
    Simoulin, Antoine
    Thiebaut, Nicolas
    Neuberger, Karl
    Ibnouhsein, Issam
    Brunel, Nicolas
    Vine, Raphael
    Bousquet, Nicolas
    Latapy, Jules
    Reix, Nathalie
    Moliere, Sebastien
    Lodi, Massimo
    Mathelin, Carole
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 240