Iterative Named Entity Recognition with Conditional Random Fields

被引:3
|
作者
Alves-Pinto, Ana [1 ]
Demus, Christoph [2 ]
Spranger, Michael [2 ]
Labudde, Dirk [2 ]
Hobley, Eleanor [1 ]
机构
[1] Zent Stelle Informat Tech Sicherheitsbereich, D-81677 Munich, Germany
[2] Univ Appl Sci Mittweida, Fac Appl Comp Sci & Biosci, D-09648 Mittweida, Germany
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 01期
基金
加拿大健康研究院; 美国国家卫生研究院;
关键词
active learning; self-learning; text; annotation; language;
D O I
10.3390/app12010330
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition (NER) constitutes an important step in the processing of unstructured text content for the extraction of information as well as for the computer-supported analysis of large amounts of digital data via machine learning methods. However, NER often relies on domain-specific knowledge, being conducted manually in a time- and human-resource-intensive process. These can be reduced with statistical models performing NER automatically. The current work investigates whether Conditional Random Fields (CRF) can be efficiently trained for NER in German texts, by means of an iterative procedure combining self-learning with a manual annotation-active learning-component. The training dataset increases continuously with the iterative procedure. Whilst self-learning did not markedly improve the performance of the CRF for NER, the manual annotation of sentences with the lowest probability of correct prediction clearly improved the model F1-score and simultaneously reduced the amount of manual annotation required to train the model. A model with an F1-score of 0.885 was able to be trained in 11.4 h.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Named Entity Recognition using Conditional Random Fields
    Patil, Nita
    Patil, Ajay
    Pawar, B., V
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1181 - 1188
  • [2] Named Entity Recognition Using Conditional Random Fields
    Khan, Wahab
    Daud, Ali
    Shahzad, Khurram
    Amjad, Tehmina
    Banjar, Ameen
    Fasihuddin, Heba
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [3] Named entity recognition based on conditional random fields
    Song, Shengli
    Zhang, Nan
    Huang, Haitao
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S5195 - S5206
  • [4] Named entity recognition based on conditional random fields
    Shengli Song
    Nan Zhang
    Haitao Huang
    [J]. Cluster Computing, 2019, 22 : 5195 - 5206
  • [5] A tool for the named entity recognition using conditional random fields
    do Amaral, Daniela Oliveira F.
    Vieira, Renata
    [J]. LINGUAMATICA, 2014, 6 (01): : 41 - 49
  • [6] Thai Named Entity Recognition Based on Conditional Random Fields
    Tirasaroj, Nutcha
    Aroonmanakun, Wirote
    [J]. 2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 216 - 220
  • [7] Conditional Random Fields based Named Entity Recognition for Sinhala
    Senevirathne, K. U.
    Attanayake, N. S.
    Dhananjanie, A. W. M. H.
    Weragoda, W. A. S. U.
    Nugaliyadde, A.
    Thelijjagoda, S.
    [J]. 2015 IEEE 10TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2015, : 302 - 307
  • [8] A Malay Named Entity Recognition Using Conditional Random Fields
    Salleh, Muhammad Sharilazlan
    Asmai, Siti Azirah
    Basiron, Halizah
    Ahmad, Sabrina
    [J]. 2017 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOIC7), 2017,
  • [9] A CONDITIONAL RANDOM FIELDS APPROACH TO BIOMEDICAL NAMED ENTITY RECOGNITION
    Wang Haochang Zhao Tiejun Li Sheng Yu Hao (School of Computer Science and Technology
    [J]. Journal of Electronics(China), 2007, (06) : 838 - 844
  • [10] Kannada Named Entity Recognition and classification using Conditional Random Fields
    Amarappa, S.
    Sathyanarayana, S. V.
    [J]. 2015 INTERNATIONAL CONFERENCE ON EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY (ICERECT), 2015, : 186 - 191