Automatic Extraction of Cancer Characteristics from Free-Text Pathology Reports for Cancer Notifications

被引:12
|
作者
Anthony Nguyen [1 ]
Moore, Julie
Lawley, Michael [1 ]
Hansen, David [1 ]
Colquist, Shoni
机构
[1] CSIRO, ICT Ctr, Australian E Hlth Res Ctr, Brisbane, Qld, Australia
关键词
Automatic Data Processing; Data Mining; Disease Notification; Neoplasm; Systematised Nomenclature of Medicine; RETRIEVAL;
D O I
10.3233/978-1-60750-791-8-117
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To develop a system for the automatic classification of Cancer Registry notifications data from free-text pathology reports. Method: The underlying technology used for the extraction of cancer notification items is based on the symbolic rule-based classification methodology, whereby formal semantics are used to reason with the systematised nomenclature of medicine - clinical terms (SNOMED CT) concepts identified in the free text. Business rules for cancer notifications used by Cancer Registry coding staff were also incorporated with the aim to mimic Cancer Registry processes. Results: The system was developed on a corpus of 239 histology and cytology reports (with 60% notifiable reports), and then evaluated on an independent set of 300 reports (with 20% notifiable reports). Results show that the system can reliably classify notifiable reports with 96% and 100% specificity, and achieve an overall accuracy of 82% and 74% for classifying notification items from notifiable reports at a unit record level from the development and evaluation set, respectively. Conclusion: Cancer Registries collect a multitude of data that requires manual review, slowing down the flow of information. Extracting and providing an automatically coded cancer pathology notification for review can lessen the reliance on expert clinical staff, improving the efficiency and availability of cancer information.
引用
收藏
页码:117 / 124
页数:8
相关论文
共 50 条
  • [31] Creating and indexing teaching files from free-text patient reports
    Johnson, DB
    Chu, WW
    Dionisio, JD
    Taira, RK
    Kangarloo, H
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, : 814 - 818
  • [32] Potential of ChatGPT and GPT-4 for Data Mining of Free-Text CT Reports on Lung Cancer
    Fink, Matthias A.
    Bischoff, Arved
    Fink, Christoph A.
    Moll, Martin
    Kroschke, Jonas
    Dulz, Luca
    Heussel, Claus Peter
    Kauczor, Hans-Ulrich
    Weber, Tim F.
    RADIOLOGY, 2023, 308 (03)
  • [33] Deep learning for natural language processing of free-text pathology reports: a comparison of learning curves
    Senders, Joeky T.
    Cote, David J.
    Mehrtash, Alireza
    Wiemann, Robert
    Gormley, William B.
    Smith, Timothy R.
    Broekman, Marike L. D.
    Arnaout, Omar
    BMJ INNOVATIONS, 2020, 6 (04) : 192 - 198
  • [34] Automatic extraction of breast cancer information from clinical reports
    Bretschneider, Claudia
    Zillner, Sonja
    Hammon, Matthias
    Gass, Paul
    Sonntag, Daniel
    2017 IEEE 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2017, : 213 - 218
  • [35] Deep Learning to Classify Radiology Free-Text Reports
    Chen, Matthew C.
    Ball, Robyn L.
    Yang, Lingyao
    Moradzadeh, Nathaniel
    Chapman, Brian E.
    Larson, David B.
    Langlotz, Curtis P.
    Amrhein, Timothy J.
    Lungren, Matthew P.
    RADIOLOGY, 2018, 286 (03) : 845 - 852
  • [36] Transformation of free-text radiology reports into structured data
    Graf, Markus M.
    Bressem, Keno K.
    Adams, Lisa C.
    RADIOLOGIE, 2025,
  • [37] Automated Histologic Grading from Free-Text Pathology Reports using Graph-of-Words Features and Machine Learning
    Yoon, Hong-Jun
    Roberts, Larry
    Tourassi, Georgia
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 369 - 372
  • [38] Structured Pathology Reporting for Cancer from Free Text: Lung Cancer Case Study
    Nguyen, Anthony
    Lawley, Michael
    Hansen, David
    Colquis, Shoni
    ELECTRONIC JOURNAL OF HEALTH INFORMATICS, 2012, 7 (01):
  • [39] Inverse Regression for Extraction of Tumor Site from Cancer Pathology Reports
    Dubey, Abhishek K.
    Yoon, Hong-Jun
    Tourassi, Georgia D.
    2019 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2019,
  • [40] Hierarchical attention networks for information extraction from cancer pathology reports
    Gao, Shang
    Young, Michael T.
    Qiu, John X.
    Yoon, Hong-Jun
    Christian, James B.
    Fearn, Paul A.
    Tourassi, Georgia D.
    Ramanthan, Arvind
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (03) : 321 - 330