HunFlair: an easy-to-use tool for state-of-the-art biomedical named entity recognition

被引:45
|
作者
Weber, Leon [1 ,2 ]
Saenger, Mario [1 ]
Munchmeyer, Jannes [1 ,3 ]
Habibi, Maryam [1 ]
Leser, Ulf [1 ]
Akbik, Alan [1 ]
机构
[1] Humboldt Univ, Comp Sci Dept, D-10099 Berlin, Germany
[2] Helmholtz Assoc, Grp Math Modelling Cellular Proc, Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
[3] GFZ German Res Ctr Geosci, Sect Seismol, D-14473 Potsdam, Germany
关键词
D O I
10.1093/bioinformatics/btab042
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Named entity recognition (NER) is an important step in biomedical information extraction pipelines. Tools for NER should be easy to use, cover multiple entity types, be highly accurate and be robust toward variations in text genre and style. We present HunFlair, a NER tagger fulfilling these requirements. HunFlair is integrated into the widely used NLP framework Flair, recognizes five biomedical entity types, reaches or overcomes state-of-the-art performance on a wide set of evaluation corpora, and is trained in a cross-corpus setting to avoid corpus-specific bias. Technically, it uses a character-level language model pretrained on roughly 24 million biomedical abstracts and three million full texts. It outperforms other off-the-shelf biomedical NER tools with an average gain of 7.26 pp over the next best tool in a cross-corpus setting and achieves on-par results with state-of-the-art research prototypes in in-corpus experiments. HunFlair can be installed with a single command and is applied with only four lines of code. Furthermore, it is accompanied by harmonized versions of 23 biomedical NER corpora.
引用
下载
收藏
页码:2792 / 2794
页数:3
相关论文
共 50 条
  • [11] A review of biomedical named entity recognition
    Chang, Lu
    Zhang, Ruihuan
    Lv, Jia
    Zhou, Weiguang
    Bai, Yunli
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (03) : 893 - 900
  • [12] On the Use of Knowledge Transfer Techniques for Biomedical Named Entity Recognition
    Mehmood, Tahir
    Serina, Ivan
    Lavelli, Alberto
    Putelli, Luca
    Gerevini, Alfonso
    FUTURE INTERNET, 2023, 15 (02):
  • [13] Biomedical named entity recognition system
    Patrick, J. (jonpat@it.usyd.edu.au), 2005, School of Information Technologies
  • [14] An easy-to-use evaluation framework for benchmarking entity recognition and disambiguation systems
    Chen, Hui
    Wei, Bao-gang
    Li, Yi-ming
    Liu, Yong-huai
    Zhu, Wen-hao
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (02) : 195 - 205
  • [15] An easy-to-use evaluation framework for benchmarking entity recognition and disambiguation systems
    Hui Chen
    Bao-gang Wei
    Yi-ming Li
    Yong-huai Liu
    Wen-hao Zhu
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 195 - 205
  • [16] Biomedical Named Entity Recognition Based on MCBERT
    Wang, Sai
    Yilahun, Hankiz
    Hamdulla, Askar
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 247 - 252
  • [17] Named Entity Recognition for Tamil Biomedical Documents
    Antony, Betina J.
    Mahalakshmi, G. S.
    2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1571 - 1577
  • [18] A Genetic Approach for Biomedical Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Sikdar, Utpal Kumar
    Hasanuzzaman, Md
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 354 - +
  • [19] Named Entity Recognition From Biomedical Data
    Refaat, Maged
    Rafea, Ahmed
    Gaballah, Nada
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 838 - 844
  • [20] A comparative study for biomedical named entity recognition
    Xu Wang
    Chen Yang
    Renchu Guan
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 373 - 382