Collection of cancer stage data by classifying free-text medical reports

被引:54
|
作者
McCowan, Iain A.
Moore, Darren C.
Nguyen, Anthony N.
Bowman, Rayleen V.
Clarke, Belinda E.
Duhig, Edwina E.
Fry, Mary-Jane
机构
[1] CSIRO, E Hlth Res Ctr, Brisbane, Qld 4000, Australia
[2] Univ Queensland, Dept Med, Brisbane, Qld 4000, Australia
[3] Prince Charles Hosp, Dept Anat Pathol, Brisbane, Qld 4032, Australia
[4] Queensland Hlth, Queensland Canc Control Anal Team, Brisbane, Qld, Australia
关键词
D O I
10.1197/jamia.M2130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cancer staging provides a basis for planning clinical management, but also allows for meaningful analysis of cancer outcomes and evaluation of cancer care services. Despite this, stage data in cancer registries is often incomplete, inaccurate, or simply not collected. This article describes a prototype software system (Cancer Stage Interpretation System, CSIS) that automatically extracts cancer staging information from medical reports. The system uses text classification techniques to train support vector machines (SVMs) to extract elements of stage listed in cancer staging guidelines. When processing new reports, CSIS identifies sentences relevant to the staging decision, and subsequently assigns the most likely stage. The system was developed using a database of staging data and pathology reports for 710 lung cancer patients, then validated in an independent set of 179 patients against pathologic stage assigned by two independent pathologists. CSIS achieved overall accuracy of 74% for tumor (T) staging and 87% for node (N) staging, and errors were observed to mirror disagreements between human experts.
引用
收藏
页码:736 / 745
页数:10
相关论文
共 50 条
  • [1] Classification of cancer stage from free-text histology reports
    McCowan, Iain
    Moore, Darren
    Fry, Mary-Jane
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 922 - +
  • [2] Transformation of free-text radiology reports into structured data
    Graf, Markus M.
    Bressem, Keno K.
    Adams, Lisa C.
    RADIOLOGIE, 2025,
  • [3] CANCER REPORTING FROM OCR FREE-TEXT PATHOLOGY REPORTS
    Zuccon, Guido
    Anthony Nguyen
    Bergheim, Anton
    Grayson, Narelle
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2012, 8 : 327 - 328
  • [4] MONITORING FREE-TEXT DATA USING MEDICAL LANGUAGE PROCESSING
    ZINGMOND, D
    LENERT, LA
    COMPUTERS AND BIOMEDICAL RESEARCH, 1993, 26 (05): : 467 - 481
  • [5] Filtering free-text medical data based on machine learning
    Grechishcheva, Sofia
    Lenivtceva, Iuliia
    Kopanitsa, Georgy
    Panfilov, Dmitry
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 82 - 91
  • [6] Automatic structuring of radiology free-text reports
    Taira, RK
    Soderland, SG
    Jakobovits, RM
    RADIOGRAPHICS, 2001, 21 (01) : 237 - 245
  • [7] Mining free-text medical records
    Heinze, DT
    Morsch, ML
    Holbrook, J
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, : 254 - 258
  • [8] Potential of ChatGPT and GPT-4 for Data Mining of Free-Text CT Reports on Lung Cancer
    Fink, Matthias A.
    Bischoff, Arved
    Fink, Christoph A.
    Moll, Martin
    Kroschke, Jonas
    Dulz, Luca
    Heussel, Claus Peter
    Kauczor, Hans-Ulrich
    Weber, Tim F.
    RADIOLOGY, 2023, 308 (03)
  • [9] RELATIONAL DATA-BASE MODELING OF FREE-TEXT MEDICAL NARRATIVE
    CHI, EC
    SAGER, N
    TICK, LJ
    LYMAN, MS
    MEDICAL INFORMATICS, 1983, 8 (03): : 209 - 223
  • [10] Improving Discrete Data Capture in Synoptic Reports With Optional Free-Text Modifiers
    Renshaw, Andrew A.
    Gould, Edwin W.
    JCO CLINICAL CANCER INFORMATICS, 2018, 2 : 1 - 6