Customizable Natural Language Processing Biomarker Extraction Tool

被引:4
|
作者
Holmes, Benjamin [1 ]
Chitale, Dhananjay [2 ]
Loving, Joshua [1 ]
Tran, Mary [1 ]
Subramanian, Vinod [1 ]
Berry, Anna [1 ]
Rioth, Matthew [1 ]
Warrier, Raghu [1 ]
Brown, Thomas [1 ]
机构
[1] Syapse Inc, 303 2nd St,Ste N500, San Francisco, CA 94107 USA
[2] Henry Ford Hlth Syst, Detroit, MI USA
来源
关键词
D O I
10.1200/CCI.21.00017
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
PURPOSE Natural language processing (NLP) in pathology reports to extract biomarker information is an ongoing area of research. MetaMap is a natural language processing tool developed and funded by the National Library of Medicine to map biomedical text to the Unified Medical Language System Metathesaurus by applying specific tags to clinically relevant terms. Although results are useful without additional postprocessing, these tags lack important contextual information. METHODS Our novel method takes terminology-driven semantic tags and incorporates those into a semantic frame that is task-specific to add necessary context to MetaMap. We use important contextual information to capture biomarker results to support Community Health System's use of Precision Medicine treatments for patients with cancer. For each biomarker, the name, type, numeric quantifiers, non-numeric qualifiers, and the time frame are extracted. These fields then associate biomarkers with their context in the pathology report such as test type, probe intensity, copy-number changes, and even failed results. A selection of 6,713 relevant reports contained the following standard-of-care biomarkers for metastatic breast cancer: breast cancer gene 1 and 2, estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and programmed death-ligand 1. RESULTS The method was tested on pathology reports from the internal pathology laboratory at Henry Ford Health System. A certified tumor registrar reviewed 400 tests, which showed > 95% accuracy for all extracted biomarker types. CONCLUSION Using this new method, it is possible to extract high-quality, contextual biomarker information, and this represents a significant advance in biomarker extraction. (C) 2021 by American Society of Clinical Oncology
引用
收藏
页码:833 / 841
页数:9
相关论文
共 50 条
  • [41] Utilizing Various Natural Language Processing Techniques for Biomedical Interaction Extraction
    Park, Kyung-Mi
    Cho, Han-Cheol
    Rim, Hae-Chang
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2011, 7 (03): : 459 - 472
  • [42] NATURAL LANGUAGE PROCESSING OF ESOPHAGOGASTRODUODENOSCOPY REPORTS FOR INFORMATION EXTRACTION OF GASTRIC DISEASES
    Bae, Jung Ho
    Han, Hyun Wook
    Song, Gyuseon
    GASTROINTESTINAL ENDOSCOPY, 2022, 95 (06) : AB247 - AB248
  • [43] Validation of a Hybrid Natural Language Processing Tool Utilizing Optical Character Recognition for Data Extraction From Scanned Colonoscopy Reports
    Hayat, Umar
    Isseh, Mahmoud
    Isseh, Nazih
    Ibrahim, Mounir
    McMichael, John
    Lopez, Rocio
    Bhatt, Amit
    Rhodes, Colin
    Burke, Carol A.
    Rizk, Maged
    GASTROINTESTINAL ENDOSCOPY, 2017, 85 (05) : AB417 - AB418
  • [44] Natural language processing
    Chowdhury, GG
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2003, 37 : 51 - 89
  • [45] Natural language processing
    Martinez, Angel R.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (03) : 352 - 357
  • [46] Natural language processing
    EDITORIAL: Automatische Sprachverarbeitung
    Hoepel-Man, Jakob, 1600, De Gruyter Oldenbourg (36):
  • [47] Natural language processing
    Gelbukh, A
    HIS 2005: 5th International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 6 - 6
  • [48] Natural language processing
    Anon
    1600, Knowledge Technology Inc. (15):
  • [49] Natural Language Processing as an Emerging Tool to Detect Late-Life Depression
    DeSouza, Danielle D.
    Robin, Jessica
    Gumus, Melisa
    Yeung, Anthony
    FRONTIERS IN PSYCHIATRY, 2021, 12
  • [50] Textinator: an Internationalized Tool for Annotation and Human Evaluation in Natural Language Processing and Generation
    Kalpakchi, Dmytro
    Boye, Johan
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 856 - 866