A novel staging system derived from natural language processing of pathology reports to predict prognostic outcomes of pancreatic cancer: a retrospective cohort study

被引:1
|
作者
Li, Bo [1 ,5 ]
Wang, Beilei [1 ,6 ]
Zhuang, Pengjie [8 ]
Cao, Hongwei [2 ]
Wu, Shengyong [3 ]
Tan, Zhendong [8 ]
Gao, Suizhi [1 ]
Li, Penghao [1 ]
Jing, Wei [1 ]
Shao, Zhuo [1 ]
Zheng, Kailian [1 ]
Wu, Lele [2 ]
Gao, Bai [2 ]
Wang, Yang [7 ]
Jiang, Hui [4 ]
Guo, Shiwei [1 ]
He, Liang [8 ,9 ,11 ]
Yang, Yan [8 ,9 ,11 ]
Jin, Gang [1 ,10 ]
机构
[1] Naval Mil Med Univ, Changhai Hosp, Dept Hepatobiliary Pancreat Surg, Shanghai, Peoples R China
[2] Naval Mil Med Univ, Changhai Hosp, Dept Informat, Shanghai, Peoples R China
[3] Naval Mil Med Univ, Changhai Hosp, Dept Mil Hlth Stat, Shanghai, Peoples R China
[4] Naval Mil Med Univ, Changhai Hosp, Dept Pathol, Shanghai, Peoples R China
[5] Navy Mil Med Univ, Naval Med Ctr, Dept Hepatobiliary Pancreat Surg, Shanghai, Peoples R China
[6] Navy Mil Med Univ, Naval Med Ctr, Dept Marine Biomed & Polar Med, Shanghai, Peoples R China
[7] Tongji Univ, Shanghai Peoples Hosp 4, Sch Med, Dept Pathol, Shanghai, Peoples R China
[8] East China Normal Univ, Sch Comp Sci & Technol, Dept Comp Sci & Technol, Shanghai, Peoples R China
[9] Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
[10] Naval Mil Med Univ, Changhai Hosp, 168 Changhai Rd, Shanghai 200433, Peoples R China
[11] East China Normal Univ, Sch Comp Sci & Technol, 3663 Zhongshan North Rd, Shanghai 200062, Peoples R China
关键词
natural language processing; pancreatic ductal adenocarcinoma; pathological report; prognosis; stratification;
D O I
10.1097/JS9.0000000000000648
中图分类号
R61 [外科手术学];
学科分类号
摘要
Objective: To construct a novel tumor-node-morphology (TNMor) staging system derived from natural language processing (NLP) of pathology reports to predict outcomes of pancreatic ductal adenocarcinoma.Method: This retrospective study with 1657 participants was based on a large referral center and The Cancer Genome Atlas Program (TCGA) dataset. In the training cohort, NLP was used to extract and screen prognostic predictors from pathology reports to develop the TNMor system, which was further evaluated with the tumor-node-metastasis (TNM) system in the internal and external validation cohort, respectively. Main outcomes were evaluated by the log-rank test of Kaplan-Meier curves, the concordance index (C-index), and the area under the receiver operating curve (AUC).Results: The precision, recall, and F1 scores of the NLP model were 88.83, 89.89, and 89.21%, respectively. In Kaplan-Meier analysis, survival differences between stages in the TNMor system were more significant than that in the TNM system. In addition, our system provided an improved C-index (internal validation, 0.58 vs. 0.54, P<0.001; external validation, 0.64 vs. 0.63, P<0.001), and higher AUCs for 1, 2, and 3-year survival (internal validation: 0.62 vs. 0.54, P<0.001; 0.64 vs. 0.60, P=0.017; 0.69 vs. 0.62, P=0.001; external validation: 0.69 vs. 0.65, P=0.098; 0.68 vs. 0.64, P=0.154; 0.64 vs. 0.55, P=0.032, respectively). Finally, our system was particularly beneficial for precise stratification of patients receiving adjuvant therapy, with an improved C-index (0.61 vs. 0.57, P<0.001), and higher AUCs for 1-year, 2-year, and 3-year survival (0.64 vs. 0.57, P<0.001; 0.64 vs. 0.58, P<0.001; 0.67 vs. 0.61, P<0.001; respectively) compared with the TNM system.Conclusion: These findings suggest that the TNMor system performed better than the TNM system in predicting pancreatic ductal adenocarcinoma prognosis. It is a promising system to screen risk-adjusted strategies for precision medicine.
引用
收藏
页码:3476 / 3489
页数:14
相关论文
共 46 条
  • [1] Automatic Lung Cancer Staging from Medical Reports Using Natural Language Processing
    Sui, X.
    Liu, T.
    Huang, Q.
    Hou, Y.
    Wang, Y.
    Kang, G.
    Guo, H.
    Li, N.
    Li, Y.
    Wang, Z.
    Wang, J.
    [J]. JOURNAL OF THORACIC ONCOLOGY, 2018, 13 (10) : S772 - S772
  • [2] Leveraging Natural Language Processing to Extract Features of Colorectal Polyps From Pathology Reports for Epidemiologic Study
    Benson, Ryzen
    Winterton, Candace
    Winn, Maci
    Krick, Benjamin
    Liu, Mei
    Abu-el-rub, Noor
    Conway, Mike
    Del Fiol, Guilherme
    Gawron, Andrew
    Hardikar, Sheetal
    [J]. JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [3] Leveraging Natural Language Processing to Extract Features of Colorectal Polyps From Pathology Reports for Epidemiologic Study
    Benson, Ryzen
    Winterton, Candace
    Winn, Maci
    Krick, Benjamin
    Liu, Mei
    Abu-el-Rub, Noor
    Conway, Mike
    Del Fiol, Guilherme
    Gawron, Andrew
    Hardikar, Sheetal
    [J]. JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [4] Development and Validation of a Natural Language Processing Algorithm for Extracting Clinical and Pathological Features of Breast Cancer From Pathology Reports
    Munzone, Elisabetta
    Marra, Antonio
    Comotto, Federico
    Guercio, Lorenzo
    Sangalli, Claudia Anna
    Lo Cascio, Martina
    Pagan, Eleonora
    Sangalli, Davide
    Bigoni, Ilaria
    Porta, Francesca Maria
    D'Ercole, Marianna
    Ritorti, Fabiana
    Bagnardi, Vincenzo
    Fusco, Nicola
    Curigliano, Giuseppe
    [J]. JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [5] A novel scoring system to predict postoperative mortality after colorectal cancer surgery: a retrospective cohort study
    Hartono, Anita
    Lesmana, Tomy
    [J]. BALI MEDICAL JOURNAL, 2022, 11 (01) : 96 - 102
  • [6] Automated medical chart review for breast cancer outcomes research: a novel natural language processing extraction system
    Yifu Chen
    Lucy Hao
    Vito Z. Zou
    Zsuzsanna Hollander
    Raymond T. Ng
    Kathryn V. Isaac
    [J]. BMC Medical Research Methodology, 22
  • [7] Automated medical chart review for breast cancer outcomes research: a novel natural language processing extraction system
    Chen, Yifu
    Hao, Lucy
    Zou, Vito Z.
    Hollander, Zsuzsanna
    Ng, Raymond T.
    Isaac, Kathryn, V
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [8] Democratizing access to research for patients with pancreatic cancer across a diverse health system through natural language processing of radiology reports.
    King, Daniel
    Valente, Cristina
    Barish, Matthew
    Carvino, Anthony
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 : 664 - 664
  • [9] Automated classification of cancer morphology from Italian pathology reports using Natural Language Processing techniques: A rule-based approach
    Lindaa, Hammami
    Alessia, Paglialonga
    Giancarlo, Pruneri
    Michele, Torresani
    Milenaa, Sant
    Carlo, Bono
    Gianluca, Caiani Enrico
    Paolo, Baili
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 116
  • [10] Natural language processing and network analysis in patients withdrawing from life-sustaining treatments: a retrospective cohort study
    Tsai, Wei-Chin
    Tsai, Yun-Cheng
    Kuo, Kuang-Cheng
    Cheng, Shao-Yi
    Tsai, Jaw-Shiun
    Chiu, Tai-Yuan
    Huang, Hsien-Liang
    [J]. BMC PALLIATIVE CARE, 2022, 21 (01)