Two learning approaches for protein name extraction

被引:6
|
作者
Tatar, Serhan [1 ]
Cicekli, Ilyas [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
Statistical learning; Bigram language model; Rule learning; Protein name extraction; Information extraction; GENE; IDENTIFICATION; PERFORMANCE; BLAST;
D O I
10.1016/j.jbi.2009.05.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein name extraction, one of the basic tasks in automatic extraction of information from biological texts, remains challenging. In this paper, we explore the use of two different machine learning techniques and present the results of the conducted experiments. in the first method, Bigram language model is used to extract protein names. In the latter, we use an automatic rule learning method that can identify protein names located in the biological texts. In both cases, we generalize protein names by using hierarchically categorized syntactic token types. We conducted our experiments on two different datasets. our first method based on Bigram language model achieved an F-score of 67.7% on the YAPEX dataset and 66.8% on the GENIA corpus. The developed rule learning method obtained 61.8% F-score value on the YAPEX dataset and 61.0% on the GENIA corpus. The results of the comparative experiments demonstrate that both techniques are applicable to the task of automatic protein name extraction, a prerequisite for the large-scale processing of biomedical literature. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:1046 / 1055
页数:10
相关论文
共 50 条
  • [31] An Active Transfer Learning Framework for Protein-Protein Interaction Extraction
    Li, Lishuang
    He, Xinyu
    Zheng, Jieqiong
    Huang, Degen
    Ren, Fuji
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (02): : 504 - 511
  • [32] Protein-Protein Interaction Extraction Based on Actively Transfer Learning
    Li, Lishuang
    Zheng, Jieqiong
    Song, Dingxin
    Guo, Rui
    Huang, Degen
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1717 - 1717
  • [33] Protein backbone angle prediction with machine learning approaches
    Kuang, R
    Leslie, CS
    Yang, AS
    [J]. BIOINFORMATICS, 2004, 20 (10) : 1612 - 1621
  • [34] Learning to forecast the exchange rate: Two competing approaches
    De Grauwe, Paul
    Markiewicz, Agnieszka
    [J]. JOURNAL OF INTERNATIONAL MONEY AND FINANCE, 2013, 32 : 42 - 76
  • [35] Machine Learning Approaches for Predicting Protein Complex Similarity
    Farhoodi, Roshanak
    Akbal-Delibas, Bahar
    Haspel, Nurit
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (01) : 40 - 51
  • [36] Two Approaches, One Course: An Experience in Experiential Learning
    Lien, Ashlee D.
    Hakim, Sharon M.
    [J]. JOURNAL OF PREVENTION & INTERVENTION IN THE COMMUNITY, 2013, 41 (02) : 128 - 135
  • [37] Deep Learning Approaches for the Protein Scaffold Filling Problem
    Sturtz, Jordan
    Zhu, Binhai
    Liu, Xiaowen
    Fu, Xingang
    Yuan, Xiaohong
    Qingge, Letu
    [J]. 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1055 - 1061
  • [38] Machine Learning Approaches for Quality Assessment of Protein Structures
    Chen, Jiarui
    Siu, Shirley W., I
    [J]. BIOMOLECULES, 2020, 10 (04)
  • [39] Novel machine learning approaches revolutionize protein knowledge
    Bordin, Nicola
    Dallago, Christian
    Heinzinger, Michael
    Kim, Stephanie
    Littmann, Maria
    Rauer, Clemens
    Steinegger, Martin
    Rost, Burkhard
    Orengo, Christine
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2023, 48 (04) : 345 - 359
  • [40] Learning outcomes of two approaches to multicultural music education
    Abril, Carlos R.
    [J]. INTERNATIONAL JOURNAL OF MUSIC EDUCATION, 2006, 24 (01) : 30 - 42