RECOGNITION OF GENE/PROTEIN NAMES USING CONDITIONAL RANDOM FIELDS

被引:0
|
作者
Campos, David [1 ]
Matos, Sergio [1 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, Inst Elect & Telemat Engn Aveiro, Campus Univ Santiago, P-3810193 Aveiro, Portugal
关键词
Natural Language Processing; Text Mining; Machine Learning; Named Entity Recognition; Gene/Protein Names; PROTEIN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the overwhelming amount of publicly available data in the biomedical field, traditional tasks performed by expert database annotators rapidly became hard and very expensive. This situation led to the development of computerized systems to extract information in a structured manner. The first step of such systems requires the identification of named entities (e.g. gene/protein names), a task called Named Entity Recognition (NER). Much of the current research to tackle this problem is based on Machine Learning (ML) techniques, which demand careful and sensitive definition of the several used methods. This article presents a NER system using Conditional Random Fields (CRFs) as the machine learning technique, combining the best techniques recently described in the literature. The proposed system uses biomedical knowledge and a large set of orthographic and morphological features. An F-measure of 0,7936 was obtained on the BioCreative II Gene Mention corpus, achieving a significantly better performance than similar baseline systems.
引用
收藏
页码:275 / 280
页数:6
相关论文
共 50 条
  • [41] Section heading recognition in electronic health records using conditional random fields
    Chen, Chih-Wei
    Chang, Nai-Wen
    Chang, Yung-Chun
    Dai, Hong-Jie
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8916 : 47 - 55
  • [42] Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features
    Copara, Jenny
    Ochoa, Jose
    Thorne, Camilo
    Glavas, Goran
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 : 175 - 186
  • [43] Human Action Recognition Using Manifold Learning and Hidden Conditional Random Fields
    Liu, Fawang
    Jia, Yunde
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 693 - 698
  • [44] Disease Named Entity Recognition Using Semisupervised Learning and Conditional Random Fields
    Suakkaphong, Nichalin
    Zhang, Zhu
    Chen, Hsinchun
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (04): : 727 - 737
  • [45] Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields
    Sapru, Ashtosh
    Bourlard, Herve
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1529 - 1533
  • [46] Pedestrian Intention Recognition using Latent-dynamic Conditional Random Fields
    Schulz, Andreas Th.
    Stiefelhagen, Rainer
    [J]. 2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 622 - 627
  • [47] Named Entity Recognition in Bengali and Hindi Using MuRIL and Conditional Random Fields
    Kaushik Bose
    Kamal Sarkar
    [J]. SN Computer Science, 5 (7)
  • [48] Diagram structure recognition by Bayesian conditional random fields
    Qi, Y
    Szummer, M
    Minka, TP
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 191 - 196
  • [49] Recognition of Internet word based on conditional random fields
    [J]. Hu, Y. (huyong@scu.edu.cn), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (11):
  • [50] Hidden Conditional Random Fields for Visual Speech Recognition
    Pass, Adrian
    Zhang, Jianguo
    Stewart, Darryl
    [J]. 2009 13TH INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, 2009, : 117 - 122