Interactive and Iterative Annotation for Biomedical Entity Recognition

被引:10
|
作者
Yimam, Seid Muhie [1 ]
Biemann, Chris [1 ]
Majnaric, Ljiljana [2 ]
Sabanovic, Sefket [2 ]
Holzinger, Andreas [3 ,4 ]
机构
[1] Tech Univ Darmstadt, FG Language Technol, CS Dept, D-64289 Darmstadt, Germany
[2] Josip Juraj Strossmayer Univ Osijek, Fac Med, Osijek, Croatia
[3] Med Univ Graz, Res Unit HCI KDD, Inst Med Informat Stat & Documentat, A-8036 Graz, Austria
[4] CBmed Ctr Biomarker Res Med, A-8010 Graz, Austria
来源
关键词
Interactive annotation; Machine learning; Knowledge discovery; Data mining; Human in the loop; Biomedical entity recognition; INSULIN-RESISTANCE; OBESITY;
D O I
10.1007/978-3-319-23344-4_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we demonstrate the impact of interactive machine learning for the development of a biomedical entity recognition dataset using a human-into-the-loop approach: during annotation, a machine learning model is built on previous annotations and used to propose labels for subsequent annotation. To demonstrate that such interactive and iterative annotation speeds up the development of quality dataset annotation, we conduct two experiments. In the first experiment, we carry out an iterative annotation experimental simulation and show that only a handful of medical abstracts need to be annotated to produce suggestions that increase annotation speed. In the second experiment, clinical doctors have conducted a case study in annotating medical terms documents relevant for their research. The experiments validate our method qualitatively and quantitatively, and give rise to a more personalized, responsive information extraction technology.
引用
收藏
页码:347 / 357
页数:11
相关论文
共 50 条
  • [41] SVM-based biomedical name entity recognition
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    Harbin Gongcheng Daxue Xuebao, 2006, SUPPL. (570-574):
  • [42] Comparison of named entity recognition methodologies in biomedical documents
    Hye-Jeong Song
    Byeong-Cheol Jo
    Chan-Young Park
    Jong-Dae Kim
    Yu-Seop Kim
    BioMedical Engineering OnLine, 17
  • [43] Classifier subset selection for biomedical named entity recognition
    Nazife Dimililer
    Ekrem Varoğlu
    Hakan Altınçay
    Applied Intelligence, 2009, 31 : 267 - 282
  • [44] Learning adaptive representations for entity recognition in the biomedical domain
    Lauriola, Ivano
    Aiolli, Fabio
    Lavelli, Alberto
    Rinaldi, Fabio
    JOURNAL OF BIOMEDICAL SEMANTICS, 2021, 12 (01)
  • [45] A Bidirectional Iterative Algorithm for Nested Named Entity Recognition
    Dadas, Slawomir
    Protasiewicz, Jaroslaw
    IEEE ACCESS, 2020, 8 (08): : 135091 - 135102
  • [46] Classifier subset selection for biomedical named entity recognition
    Dimililer, Nazife
    Varoglu, Ekrem
    Altincay, Hakan
    APPLIED INTELLIGENCE, 2009, 31 (03) : 267 - 282
  • [47] Learning adaptive representations for entity recognition in the biomedical domain
    Ivano Lauriola
    Fabio Aiolli
    Alberto Lavelli
    Fabio Rinaldi
    Journal of Biomedical Semantics, 12
  • [48] Towards reliable named entity recognition in the biomedical domain
    Giorgi, John M.
    Bader, Gary D.
    BIOINFORMATICS, 2020, 36 (01) : 280 - 286
  • [49] Entity recognition in the biomedical domain using a hybrid approach
    Basaldella, Marco
    Furrer, Lenz
    Tasso, Carlo
    Rinaldi, Fabio
    JOURNAL OF BIOMEDICAL SEMANTICS, 2017, 8
  • [50] Improving biomedical named entity recognition with syntactic information
    Tian, Yuanhe
    Shen, Wang
    Song, Yan
    Xia, Fei
    He, Min
    Li, Kenli
    BMC BIOINFORMATICS, 2020, 21 (01)