Customizable Natural Language Processing Biomarker Extraction Tool

被引:4
|
作者
Holmes, Benjamin [1 ]
Chitale, Dhananjay [2 ]
Loving, Joshua [1 ]
Tran, Mary [1 ]
Subramanian, Vinod [1 ]
Berry, Anna [1 ]
Rioth, Matthew [1 ]
Warrier, Raghu [1 ]
Brown, Thomas [1 ]
机构
[1] Syapse Inc, 303 2nd St,Ste N500, San Francisco, CA 94107 USA
[2] Henry Ford Hlth Syst, Detroit, MI USA
来源
关键词
D O I
10.1200/CCI.21.00017
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
PURPOSE Natural language processing (NLP) in pathology reports to extract biomarker information is an ongoing area of research. MetaMap is a natural language processing tool developed and funded by the National Library of Medicine to map biomedical text to the Unified Medical Language System Metathesaurus by applying specific tags to clinically relevant terms. Although results are useful without additional postprocessing, these tags lack important contextual information. METHODS Our novel method takes terminology-driven semantic tags and incorporates those into a semantic frame that is task-specific to add necessary context to MetaMap. We use important contextual information to capture biomarker results to support Community Health System's use of Precision Medicine treatments for patients with cancer. For each biomarker, the name, type, numeric quantifiers, non-numeric qualifiers, and the time frame are extracted. These fields then associate biomarkers with their context in the pathology report such as test type, probe intensity, copy-number changes, and even failed results. A selection of 6,713 relevant reports contained the following standard-of-care biomarkers for metastatic breast cancer: breast cancer gene 1 and 2, estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and programmed death-ligand 1. RESULTS The method was tested on pathology reports from the internal pathology laboratory at Henry Ford Health System. A certified tumor registrar reviewed 400 tests, which showed > 95% accuracy for all extracted biomarker types. CONCLUSION Using this new method, it is possible to extract high-quality, contextual biomarker information, and this represents a significant advance in biomarker extraction. (C) 2021 by American Society of Clinical Oncology
引用
收藏
页码:833 / 841
页数:9
相关论文
共 50 条
  • [1] Data Extraction by Using Natural Language Processing Tool
    More, Sujata D.
    Madankar, Mangala S.
    Chandak, M. B.
    HELIX, 2018, 8 (05): : 3846 - 3848
  • [2] Language as a biomarker for psychosis: A natural language processing approach
    Corcoran, Cheryl M.
    Mittal, Vijay A.
    Bearden, Carrie E.
    Gur, Raquel E.
    Hitczenko, Kasia
    Bilgrami, Zarina
    Savic, Aleksandar
    Cecchi, Guillermo A.
    Wolff, Phillip
    SCHIZOPHRENIA RESEARCH, 2020, 226 : 158 - 166
  • [3] Knowledge extraction from natural language processing
    Sbattella, L. (licia.sbattella@polimi.it), 1600, Springer Verlag (7200 LNCS):
  • [4] Generating Customizable Natural Language Descriptions
    Costa, A.
    Paraboni, I
    IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1252 - 1258
  • [5] Extraction of Breast Cancer Biomarker Data from Narrative Clinical Documents Using Natural Language Processing
    He, Jinghua
    Ouyang, Fangqian
    Eckert, George
    Martin, Joel
    Church, Abby
    Knapp, Kristina
    Dexter, Paul
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 36 - 36
  • [6] A Natural Language Processing Tool for Large-Scale Data Extraction from Echocardiography Reports
    Nath, Chinmoy
    Albaghdadi, Mazen S.
    Jonnalagadda, Siddhartha R.
    PLOS ONE, 2016, 11 (04):
  • [7] A new natural language processing tool for case simulations
    Lehmann, CU
    Nguyen, B
    Kim, GR
    Johnson, KB
    Lehmann, HP
    PEDIATRICS, 1999, 104 (03) : 672 - 673
  • [8] Biomolecular Event Extraction using Natural Language Processing
    Bali, Manish
    Anandaraj, S. P.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (05) : 601 - 612
  • [9] Natural Language Processing and Automatic Knowledge Extraction for Lexicography
    Krek, Simon
    INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2019, 32 (02) : 115 - 118
  • [10] Analyzing Discourse Processing Using a Simple Natural Language Processing Tool
    Crossley, Scott A.
    Allen, Laura K.
    Kyle, Kristopher
    McNamara, Danielle S.
    DISCOURSE PROCESSES, 2014, 51 (5-6) : 511 - 534