Biomedical named entity recognition using generalized expectation criteria

被引:5
|
作者
Yao, Lin [1 ,2 ]
Sun, Chengjie [3 ]
Wu, Yan [2 ,3 ]
Wang, Xiaolong [1 ]
Wang, Xuan [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Sch Software, Harbin 150006, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Conditional random field; General expectation; Latent Dirichlet allocation; Biomedical named entity recognition; Semi-supervised learning; LATENT; CLASSIFICATION; FEATURES;
D O I
10.1007/s13042-011-0022-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is difficult to apply machine learning to a domain which is short of labeled training data, such as biomedical named entity recognition (NER) which remains a challenging task because of its extraordinary complex nomenclature. In this paper, we proposed a semi-supervised method which can train condition random field (CRF) models using generalized expectation (GE) criteria to solve biomedical named entity recognition problem. In the proposed method, instead of "instance'' labeling, the "feature'' labeling is applied to get the training data which can save lots of labeling time. Latent Dirichlet Allocation (LDA) model was involved to choose the features for labeling. Experiment results show that the proposed method can dramatically improve the performance of biomedical NER through incorporating unlabeled data by feature labeling.
引用
收藏
页码:235 / 243
页数:9
相关论文
共 50 条
  • [21] BIOMEDICAL NAMED ENTITY RECOGNITION USING SECONDORDER CONDITIONAL RANDOM FIELDS
    Thipcharoen, Supattanawaree
    Subpaiboonkit, Sitthichoke
    Chaijaruwanich, Jeerayut
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 2, 2012, : 397 - 401
  • [22] Named entity recognition and classification in biomedical text using classifier ensemble
    Saha, Sriparna
    Ekbal, Asif
    Sikdar, Utpal Kumar
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (04) : 365 - 391
  • [23] Hadoop Recognition of Biomedical Named Entity Using Conditional Random Fields
    Li, Kenli
    Ai, Wei
    Tang, Zhuo
    Zhang, Fan
    Jiang, Lingang
    Li, Keqin
    Hwang, Kai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (11) : 3040 - 3051
  • [24] Towards Bootstrapping Biomedical Named Entity Recognition using Reinforcement Learning
    Wang, Dongsheng
    Fan, Hongjie
    Liu, Junfei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 778 - 784
  • [25] MMBERT: a unified framework for biomedical named entity recognition
    Lei Fu
    Zuquan Weng
    Jiheng Zhang
    Haihe Xie
    Yiqing Cao
    Medical & Biological Engineering & Computing, 2024, 62 : 327 - 341
  • [26] Study of Named Entity Recognition methods in biomedical field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 260 - 265
  • [27] Comparison of named entity recognition methodologies in biomedical documents
    Song, Hye-Jeong
    Jo, Byeong-Cheol
    Park, Chan-Young
    Kim, Jong-Dae
    Kim, Yu-Seop
    BIOMEDICAL ENGINEERING ONLINE, 2018, 17
  • [28] Towards the Named Entity Recognition Methods in Biomedical Field
    Sniegula, Anna
    Poniszewska-Maranda, Aneta
    Chomatek, Lukasz
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 375 - 387
  • [29] Improving biomedical named entity recognition with syntactic information
    Yuanhe Tian
    Wang Shen
    Yan Song
    Fei Xia
    Min He
    Kenli Li
    BMC Bioinformatics, 21
  • [30] Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
    Ekbal, Asif
    Saha, Sriparna
    Sikdar, Utpal Kumar
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 206 - 213