Automatic Extraction of Cancer Characteristics from Free-Text Pathology Reports for Cancer Notifications

被引:12
|
作者
Anthony Nguyen [1 ]
Moore, Julie
Lawley, Michael [1 ]
Hansen, David [1 ]
Colquist, Shoni
机构
[1] CSIRO, ICT Ctr, Australian E Hlth Res Ctr, Brisbane, Qld, Australia
关键词
Automatic Data Processing; Data Mining; Disease Notification; Neoplasm; Systematised Nomenclature of Medicine; RETRIEVAL;
D O I
10.3233/978-1-60750-791-8-117
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To develop a system for the automatic classification of Cancer Registry notifications data from free-text pathology reports. Method: The underlying technology used for the extraction of cancer notification items is based on the symbolic rule-based classification methodology, whereby formal semantics are used to reason with the systematised nomenclature of medicine - clinical terms (SNOMED CT) concepts identified in the free text. Business rules for cancer notifications used by Cancer Registry coding staff were also incorporated with the aim to mimic Cancer Registry processes. Results: The system was developed on a corpus of 239 histology and cytology reports (with 60% notifiable reports), and then evaluated on an independent set of 300 reports (with 20% notifiable reports). Results show that the system can reliably classify notifiable reports with 96% and 100% specificity, and achieve an overall accuracy of 82% and 74% for classifying notification items from notifiable reports at a unit record level from the development and evaluation set, respectively. Conclusion: Cancer Registries collect a multitude of data that requires manual review, slowing down the flow of information. Extracting and providing an automatically coded cancer pathology notification for review can lessen the reliance on expert clinical staff, improving the efficiency and availability of cancer information.
引用
收藏
页码:117 / 124
页数:8
相关论文
共 50 条
  • [1] CANCER REPORTING FROM OCR FREE-TEXT PATHOLOGY REPORTS
    Zuccon, Guido
    Anthony Nguyen
    Bergheim, Anton
    Grayson, Narelle
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2012, 8 : 327 - 328
  • [2] Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks
    Alawad, Mohammed
    Gao, Shang
    Qiu, John X.
    Yoon, Hong Jun
    Christian, J. Blair
    Penberthy, Lynne
    Mumphrey, Brent
    Wu, Xiao-Cheng
    Coyle, Linda
    Tourassi, Georgia
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) : 89 - 98
  • [3] Machine Learning-Based Extraction of Breast Cancer Receptor Status From Bilingual Free-Text Pathology Reports
    Pironet, Antoine
    Poirel, Helene A.
    Tambuyzer, Tim
    De Schutter, Harlinde
    van Walle, Lien
    Mattheijssens, Joris
    Henau, Kris
    Van Eycken, Liesbet
    Van Damme, Nancy
    FRONTIERS IN DIGITAL HEALTH, 2021, 3
  • [4] Automated Classification of Free-text Pathology Reports for Registration of Incident Cases of Cancer
    Jouhet, V.
    Defossez, G.
    Burgun, A.
    le Beux, P.
    Levillain, P.
    Ingrand, P.
    Claveau, V.
    METHODS OF INFORMATION IN MEDICINE, 2012, 51 (03) : 242 - 251
  • [5] Classification of cancer stage from free-text histology reports
    McCowan, Iain
    Moore, Darren
    Fry, Mary-Jane
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 922 - +
  • [6] Symbolic rule-based classification of lung cancer stages from free-text pathology reports
    Nguyen, Anthony N.
    Lawley, Michael J.
    Hansen, David P.
    Bowman, Rayleen V.
    Clarke, Belinda E.
    Duhig, Edwina E.
    Colquist, Shoni
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (04) : 440 - 445
  • [7] Automatic structuring of radiology free-text reports
    Taira, RK
    Soderland, SG
    Jakobovits, RM
    RADIOGRAPHICS, 2001, 21 (01) : 237 - 245
  • [8] Automated Information Extraction from Free-Text EEG Reports
    Biswal, Siddharth
    Nip, Zarina
    Moura Junior, Valdcry
    Bianchi, Matt T.
    Rosenthal, Eric S.
    Westover, M. Brandon
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 6804 - 6807
  • [9] A Text Mining Approach in the Classification of Free-Text Cancer Pathology Reports from the South African National Health Laboratory Services
    Achilonu, Okechinyere J.
    Olago, Victor
    Singh, Elvira
    Eijkemans, Rene M. J. C.
    Nimako, Gideon
    Musenge, Eustasius
    INFORMATION, 2021, 12 (11)
  • [10] Automatic information extraction from childhood cancer pathology reports
    Yoon, Hong-Jun
    Peluso, Alina
    Durbin, Eric B.
    Wu, Xiao-Cheng
    Stroup, Antoinette
    Doherty, Jennifer
    Schwartz, Stephen
    Wiggins, Charles
    Coyle, Linda
    Penberthy, Lynne
    JAMIA OPEN, 2022, 5 (02)