Web Interface of NER and RE with BERT for Biomedical Text Mining

被引:2
|
作者
Park, Yeon-Ji [1 ]
Lee, Min-a [1 ]
Yang, Geun-Je [1 ]
Park, Soo Jun [2 ]
Sohn, Chae-Bong [1 ]
机构
[1] Kwangwoon Univ, Dept Elect & Commun Engn, Seoul 01897, South Korea
[2] Elect & Telecommun Res Inst, Digital Biomed Res Div, Daejeon 34129, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
基金
新加坡国家研究基金会;
关键词
BERT; web service; natural language process; text mining; biomedical domain; named-entity recognition; relation extraction; fine-tuning model; NORMALIZATION; RECOGNITION;
D O I
10.3390/app13085163
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The BioBERT Named Entity Recognition (NER) model is a high-performance model designed to identify both known and unknown entities. It surpasses previous NER models utilized by text-mining tools, such as tmTool and ezTag, in effectively discovering novel entities. In previous studies, the Biomedical Entity Recognition and Multi-Type Normalization Tool (BERN) employed this model to identify words that represent specific names, discern the type of the word, and implement it on a web page to offer NER service. However, we aimed to offer a web service that includes Relation Extraction (RE), a task determining the relation between entity pairs within a sentence. First, just like BERN, we fine-tuned the BioBERT NER model within the biomedical domain to recognize new entities. We identified two categories: diseases and genes/proteins. Additionally, we fine-tuned the BioBERT RE model to determine the presence or absence of a relation between the identified gene-disease entity pairs. The NER and RE results are displayed on a web page using the Django web framework. NER results are presented in distinct colors, and RE results are visualized as graphs in NetworkX and Cytoscape, allowing users to interact with the graphs.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Biomedical Text NER Tagging Tool with Web Interface for Generating BERT-Based Fine-Tuning Dataset
    Park, Yeon-Ji
    Lee, Min-a
    Yang, Geun-Je
    Park, Soo Jun
    Sohn, Chae-Bong
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [2] OntoGene web services for biomedical text mining
    Rinaldi, Fabio
    Clematide, Simon
    Marques, Hernani
    Ellendorff, Tilia
    Romacker, Martin
    Rodriguez-Esteban, Raul
    BMC BIOINFORMATICS, 2014, 15
  • [3] OntoGene web services for biomedical text mining
    Fabio Rinaldi
    Simon Clematide
    Hernani Marques
    Tilia Ellendorff
    Martin Romacker
    Raul Rodriguez-Esteban
    BMC Bioinformatics, 15
  • [4] An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining
    Peng, Yifan
    Chen, Qingyu
    Lu, Zhiyong
    19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020), 2020, : 205 - 214
  • [5] Dependency parsing of biomedical text with BERT
    Jenna Kanerva
    Filip Ginter
    Sampo Pyysalo
    BMC Bioinformatics, 21
  • [6] Dependency parsing of biomedical text with BERT
    Kanerva, Jenna
    Ginter, Filip
    Pyysalo, Sampo
    BMC BIOINFORMATICS, 2020, 21 (Suppl 23)
  • [7] Mondou: Interface with text data mining for Web search engine
    Kawano, H
    Hasegawa, T
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 275 - 283
  • [8] Text mining the biomedical literature
    Pertsemlidis, A
    BIOPHYSICAL JOURNAL, 2002, 82 (01) : 168A - 168A
  • [9] BioTextQuest: a web-based biomedical text mining suite for concept discovery
    Papanikolaou, Nikolas
    Pafilis, Evangelos
    Nikolaou, Stavros
    Ouzounis, Christos A.
    Iliopoulos, Ioannis
    Promponas, Vasilis J.
    BIOINFORMATICS, 2011, 27 (23) : 3327 - 3328
  • [10] @Note: A workbench for Biomedical Text Mining
    Lourenco, Analia
    Carreira, Rafael
    Carneiro, Sonia
    Maia, Paulo
    Glez-Pena, Daniel
    Fdez-Riverola, Florentino
    Ferreira, Eugenio C.
    Rocha, Isabel
    Rocha, Miguel
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (04) : 710 - 720