Mining literature for protein-protein interactions

被引:174
|
作者
Marcotte, EM
Xenarios, I
Eisenberg, D
机构
[1] Univ Calif Los Angeles, Lab Struct Biol & Mol Med, DOE, Inst Mol Biol, Los Angeles, CA 90095 USA
[2] Prot Pathways Inc, Los Angeles, CA 90024 USA
[3] Univ Texas, Dept Chem & Biochem, Inst Cell & Mol Biol, Austin, TX 78712 USA
关键词
D O I
10.1093/bioinformatics/17.4.359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A central problem in bioinformatics is how to capture information from the vast current scientific literature in a form suitable for analysis by computer. We address the special case of information on protein-protein interactions, and show that the frequencies of words in Medline abstracts can be used to determine whether or not a given paper discusses protein-protein interactions. For those papers determined to discuss this topic, the relevant information can be captured for the Database of Interacting Proteins. Furthermore, suitable gene annotations can also be captured. Results: Our Bayesian approach scores Medline abstracts for probability of discussing the topic of interest according to the frequencies of discriminating words found in the abstract. More than 80 discriminating words (e.g, complex, interaction, two-hybrid) were determined from a training set of 260 Medline abstracts corresponding to previously validated entries in the Database of Interacting Proteins, Using these words and a log likelihood scoring function, similar to 2000 Medline abstracts were identified as describing interactions between yeast proteins. This approach now forms the basis for the rapid expansion of the Database of Interacting Proteins.
引用
收藏
页码:359 / 363
页数:5
相关论文
共 50 条
  • [41] Protein Function Assignment through Mining Cross-Species Protein-Protein Interactions
    Chen, Xue-wen
    Liu, Mei
    Ward, Robert
    [J]. PLOS ONE, 2008, 3 (02):
  • [42] Automated extraction of information on protein-protein interactions from the biological literature
    Ono, T
    Hishigaki, H
    Tanigami, A
    Takagi, T
    [J]. BIOINFORMATICS, 2001, 17 (02) : 155 - 161
  • [43] Assisting manual literature curation for protein-protein interactions using BioQRator
    Kwon, Dongseop
    Kim, Sun
    Shin, Soo-Yong
    Chatr-aryamontri, Andrew
    Wilbur, W. John
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
  • [44] Construct Protein-Protein Interaction Network by Mining Domain-Domain Interactions
    Zhixia Teng
    Maozu Guo
    Xiaoyan Liu
    Jin Li
    Qiguo Dai
    Chunyu Wang
    [J]. Journal of Harbin Institute of Technology(New series), 2016, (04) : 27 - 36
  • [45] Construct Protein-Protein Interaction Network by Mining Domain-Domain Interactions
    Zhixia Teng
    Maozu Guo
    Xiaoyan Liu
    Jin Li
    Qiguo Dai
    Chunyu Wang
    [J]. Journal of Harbin Institute of Technology., 2016, 23 (04) - 36
  • [46] Evidence mining and novelty assessment of protein-protein interactions with the ConsensusPathDB plugin for Cytoscape
    Pentchev, Konstantin
    Ono, Keiichiro
    Herwig, Ralf
    Ideker, Trey
    Kamburov, Atanas
    [J]. BIOINFORMATICS, 2010, 26 (21) : 2796 - 2797
  • [47] Aquaporin Protein-Protein Interactions
    Roche, Jennifer Virginia
    Tornroth-Horsefield, Susanna
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (11)
  • [48] Efficient mining from heterogeneous data sets for predicting protein-protein interactions
    Mamitsuka, H
    [J]. 14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 32 - 36
  • [49] Contextualized Protein-Protein Interactions
    Federico, Anthony
    Monti, Stefano
    [J]. PATTERNS, 2021, 2 (01):
  • [50] Measuring protein-protein interactions
    Lakey, JH
    Raggett, EM
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (01) : 119 - 123