Two learning approaches for protein name extraction

被引:6
|
作者
Tatar, Serhan [1 ]
Cicekli, Ilyas [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
Statistical learning; Bigram language model; Rule learning; Protein name extraction; Information extraction; GENE; IDENTIFICATION; PERFORMANCE; BLAST;
D O I
10.1016/j.jbi.2009.05.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein name extraction, one of the basic tasks in automatic extraction of information from biological texts, remains challenging. In this paper, we explore the use of two different machine learning techniques and present the results of the conducted experiments. in the first method, Bigram language model is used to extract protein names. In the latter, we use an automatic rule learning method that can identify protein names located in the biological texts. In both cases, we generalize protein names by using hierarchically categorized syntactic token types. We conducted our experiments on two different datasets. our first method based on Bigram language model achieved an F-score of 67.7% on the YAPEX dataset and 66.8% on the GENIA corpus. The developed rule learning method obtained 61.8% F-score value on the YAPEX dataset and 61.0% on the GENIA corpus. The results of the comparative experiments demonstrate that both techniques are applicable to the task of automatic protein name extraction, a prerequisite for the large-scale processing of biomedical literature. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:1046 / 1055
页数:10
相关论文
共 50 条
  • [21] SOPHISTRY AND PHILOSOPHY: TWO APPROACHES TO TEACHING LEARNING
    Sturm, Sean
    [J]. EDUCATION IN THE KNOWLEDGE SOCIETY, 2013, 14 (03): : 25 - 36
  • [22] Two Clause Learning Approaches for Disjunctive Scheduling
    Siala, Mohamed
    Artigues, Christian
    Hebrard, Emmanuel
    [J]. PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2015, 2015, 9255 : 393 - 402
  • [23] Comparison of the efficacy of two name-learning techniques: Expanding rehearsal and name-face imagery
    Neuschatz, J
    Preston, EL
    Toglia, MP
    Neuschatz, JS
    [J]. AMERICAN JOURNAL OF PSYCHOLOGY, 2005, 118 (01): : 79 - 101
  • [24] Human Joint Profile Extraction using Deep Learning Approaches
    Weisscohen, Miri
    Vitali, Andrea
    Regazzoni, Daniele
    [J]. Computer-Aided Design and Applications, 2023, 20 (04): : 704 - 715
  • [25] In the name of protein
    Julie Guthman
    Michaelanne Butler
    Sarah J. Martin
    Charles Mather
    Charlotte Biltekoff
    [J]. Nature Food, 2022, 3 (6): : 391 - 393
  • [26] In the name of protein
    Guthman, Julie
    Butler, Michaelanne
    Martin, Sarah J.
    Mather, Charles
    Biltekoff, Charlotte
    [J]. NATURE FOOD, 2022, 3 (06): : 391 - 393
  • [27] Two Different Approaches of Feature Extraction for Classifying the EEG Signals
    Jahankhani, Pari
    Lara, Juan A.
    Perez, Aurora
    Valente, Juan P.
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT I, 2011, 363 : 229 - +
  • [28] One stage versus two stages deep learning approaches for the extraction of drug-drug interactions from texts
    Miranda-Escalada, Antonio
    Segura-Bedmar, Isabel
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (64): : 69 - 76
  • [29] Ontology Driven Machine learning Approach for Disease Name Extraction from Twitter Messages
    Magumba, Mark Abraham
    Nabende, Peter
    Mwebaze, Earnest
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 68 - 73
  • [30] Application of Machine Learning Approaches for Protein-protein Interactions Prediction
    Zhang, Mengying
    Su, Qiang
    Lu, Yi
    Zhao, Manman
    Niu, Bing
    [J]. MEDICINAL CHEMISTRY, 2017, 13 (06) : 506 - 514