Automatic Extraction of Cancer Characteristics from Free-Text Pathology Reports for Cancer Notifications

被引:12
|
作者
Anthony Nguyen [1 ]
Moore, Julie
Lawley, Michael [1 ]
Hansen, David [1 ]
Colquist, Shoni
机构
[1] CSIRO, ICT Ctr, Australian E Hlth Res Ctr, Brisbane, Qld, Australia
关键词
Automatic Data Processing; Data Mining; Disease Notification; Neoplasm; Systematised Nomenclature of Medicine; RETRIEVAL;
D O I
10.3233/978-1-60750-791-8-117
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To develop a system for the automatic classification of Cancer Registry notifications data from free-text pathology reports. Method: The underlying technology used for the extraction of cancer notification items is based on the symbolic rule-based classification methodology, whereby formal semantics are used to reason with the systematised nomenclature of medicine - clinical terms (SNOMED CT) concepts identified in the free text. Business rules for cancer notifications used by Cancer Registry coding staff were also incorporated with the aim to mimic Cancer Registry processes. Results: The system was developed on a corpus of 239 histology and cytology reports (with 60% notifiable reports), and then evaluated on an independent set of 300 reports (with 20% notifiable reports). Results show that the system can reliably classify notifiable reports with 96% and 100% specificity, and achieve an overall accuracy of 82% and 74% for classifying notification items from notifiable reports at a unit record level from the development and evaluation set, respectively. Conclusion: Cancer Registries collect a multitude of data that requires manual review, slowing down the flow of information. Extracting and providing an automatically coded cancer pathology notification for review can lessen the reliance on expert clinical staff, improving the efficiency and availability of cancer information.
引用
收藏
页码:117 / 124
页数:8
相关论文
共 50 条
  • [21] Information extraction from free-text business documents
    Abramowicz, W
    Piskorski, J
    ISSUES AND TRENDS OF INFORMATION TECHNOLOGY MANAGEMENT IN CONTEMPORARY ORGANIZATIONS, VOLS 1 AND 2, 2002, : 626 - 630
  • [22] Automatic Scanning of Free-Text Entries
    Lamer, Antoine
    Marcilly, Romaric
    Jeanne, Mathieu
    Logier, Regis
    E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 1196 - 1196
  • [23] Rule-Based Information Extraction from Free-Text Pathology Reports Reveals Trends in South African Female Breast Cancer Molecular Subtypes and Ki67 Expression
    Achilonu, Okechinyere J.
    Singh, Elvira
    Nimako, Gideon
    Eijkemans, Rene M. J. C.
    Musenge, Eustasius
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [24] Extracting Cancer Mortality Statistics from Free-text Death Certificates
    Koopman, Bevan
    Nguyen, Anthony
    Cossio, Danica
    Courage, Mary-Jane
    Francois, Gary
    ADCS'18: PROCEEDINGS OF THE 23RD AUSTRALASIAN DOCUMENT COMPUTING SYMPOSIUM, 2018,
  • [25] Automated extraction of information from free text of Spanish oncology pathology reports
    Mendoza-Urbano, Diana Marcela
    Garcia, Johan Felipe
    Moreno, Juan Sebastian
    Bravo-Ocana, Juan Carlos
    Riascos, Alvaro Jose
    Harvey, Angela Zambrano
    Prada, Sergio, I
    COLOMBIA MEDICA, 2023, 54 (01):
  • [26] A Natural Language Processing Pipeline of Chinese Free-Text Radiology Reports for Liver Cancer Diagnosis
    Liu, Honglei
    Xu, Yan
    Zhang, Zhiqiang
    Wang, Ni
    Huang, Yanqun
    Hu, Yanjun
    Yang, Zhenghan
    Jiang, Rui
    Chen, Hui
    IEEE ACCESS, 2020, 8 : 159110 - 159119
  • [27] Automatic intracranial abnormality detection and localization in head CT scans by learning from free-text reports
    Liu, Aohan
    Guo, Yuchen
    Lyu, Jinhao
    Xie, Jing
    Xu, Feng
    Lou, Xin
    Yong, Jun-hai
    Dai, Qionghai
    CELL REPORTS MEDICINE, 2023, 4 (09)
  • [28] Automatic Structured Reporting from Narrative Cancer Pathology Reports
    Ou, Ying
    Patrick, Jon
    ELECTRONIC JOURNAL OF HEALTH INFORMATICS, 2014, 8 (02):
  • [30] Multi-class classification of cancer stages from free-text histology reports using support vector machines
    Nguyen, Anthony
    Moore, Darren
    McCowan, Lain
    Courage, Mary-Jane
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 5140 - 5143