Biomedical named entity recognition using generalized expectation criteria

被引:5
|
作者
Yao, Lin [1 ,2 ]
Sun, Chengjie [3 ]
Wu, Yan [2 ,3 ]
Wang, Xiaolong [1 ]
Wang, Xuan [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Sch Software, Harbin 150006, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Conditional random field; General expectation; Latent Dirichlet allocation; Biomedical named entity recognition; Semi-supervised learning; LATENT; CLASSIFICATION; FEATURES;
D O I
10.1007/s13042-011-0022-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is difficult to apply machine learning to a domain which is short of labeled training data, such as biomedical named entity recognition (NER) which remains a challenging task because of its extraordinary complex nomenclature. In this paper, we proposed a semi-supervised method which can train condition random field (CRF) models using generalized expectation (GE) criteria to solve biomedical named entity recognition problem. In the proposed method, instead of "instance'' labeling, the "feature'' labeling is applied to get the training data which can save lots of labeling time. Latent Dirichlet Allocation (LDA) model was involved to choose the features for labeling. Experiment results show that the proposed method can dramatically improve the performance of biomedical NER through incorporating unlabeled data by feature labeling.
引用
收藏
页码:235 / 243
页数:9
相关论文
共 50 条
  • [1] Biomedical named entity recognition using generalized expectation criteria
    Lin Yao
    Chengjie Sun
    Yan Wu
    Xiaolong Wang
    Xuan Wang
    International Journal of Machine Learning and Cybernetics, 2011, 2 : 235 - 243
  • [2] Various criteria in the evaluation of biomedical named entity recognition
    Tsai, RTH
    Wu, SH
    Chou, WC
    Lin, YC
    He, D
    Hsiang, J
    Sung, TY
    Hsu, WL
    BMC BIOINFORMATICS, 2006, 7 (1) : 1 - 8
  • [3] Various criteria in the evaluation of biomedical named entity recognition
    Richard Tzong-Han Tsai
    Shih-Hung Wu
    Wen-Chi Chou
    Yu-Chun Lin
    Ding He
    Jieh Hsiang
    Ting-Yi Sung
    Wen-Lian Hsu
    BMC Bioinformatics, 7
  • [4] Named Entity Recognition for Vietnamese documents using semi-supervised learning method of CRFs with Generalized Expectation Criteria
    Thi-Ngan Pham
    Le Minh Nguyen
    Quang-Thuy Ha
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 85 - 88
  • [5] A review of biomedical named entity recognition
    Chang, Lu
    Zhang, Ruihuan
    Lv, Jia
    Zhou, Weiguang
    Bai, Yunli
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (03) : 893 - 900
  • [6] Biomedical named entity recognition system
    Patrick, J. (jonpat@it.usyd.edu.au), 2005, School of Information Technologies
  • [7] Lao Named Entity Recognition based on semi-supervised cascaded Conditional Random Fields with generalized expectation criteria
    Yang, Mengjie
    Zhou, Lanjiang
    Yu, Zhengtao
    Wang, Hongbin
    Journal of Computational Information Systems, 2015, 11 (20): : 7595 - 7606
  • [8] Biomedical Named Entity Recognition Based on MCBERT
    Wang, Sai
    Yilahun, Hankiz
    Hamdulla, Askar
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 247 - 252
  • [9] Named Entity Recognition for Tamil Biomedical Documents
    Antony, Betina J.
    Mahalakshmi, G. S.
    2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1571 - 1577
  • [10] A Genetic Approach for Biomedical Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Sikdar, Utpal Kumar
    Hasanuzzaman, Md
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 354 - +