Mining literature for protein-protein interactions

被引:174
|
作者
Marcotte, EM
Xenarios, I
Eisenberg, D
机构
[1] Univ Calif Los Angeles, Lab Struct Biol & Mol Med, DOE, Inst Mol Biol, Los Angeles, CA 90095 USA
[2] Prot Pathways Inc, Los Angeles, CA 90024 USA
[3] Univ Texas, Dept Chem & Biochem, Inst Cell & Mol Biol, Austin, TX 78712 USA
关键词
D O I
10.1093/bioinformatics/17.4.359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A central problem in bioinformatics is how to capture information from the vast current scientific literature in a form suitable for analysis by computer. We address the special case of information on protein-protein interactions, and show that the frequencies of words in Medline abstracts can be used to determine whether or not a given paper discusses protein-protein interactions. For those papers determined to discuss this topic, the relevant information can be captured for the Database of Interacting Proteins. Furthermore, suitable gene annotations can also be captured. Results: Our Bayesian approach scores Medline abstracts for probability of discussing the topic of interest according to the frequencies of discriminating words found in the abstract. More than 80 discriminating words (e.g, complex, interaction, two-hybrid) were determined from a training set of 260 Medline abstracts corresponding to previously validated entries in the Database of Interacting Proteins, Using these words and a log likelihood scoring function, similar to 2000 Medline abstracts were identified as describing interactions between yeast proteins. This approach now forms the basis for the rapid expansion of the Database of Interacting Proteins.
引用
收藏
页码:359 / 363
页数:5
相关论文
共 50 条
  • [1] Mining physical protein-protein interactions from the literature
    Huang, Minlie
    Ding, Shilin
    Wang, Hongning
    Zhu, Xiaoyan
    [J]. GENOME BIOLOGY, 2008, 9
  • [2] Mining physical protein-protein interactions from the literature
    Huang M.
    Ding S.
    Wang H.
    Zhu X.
    [J]. Genome Biology, 9 (Suppl 2):
  • [3] Mining Impact of Protein Modifications on Protein-Protein Interactions from Literature
    Siu, Amy
    Arighi, Cecilia
    Nchoutmboube, Jules
    Tudor, Catalina O.
    Vijay-Shanker, K.
    Wu, Cathy H.
    [J]. BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 343 - 343
  • [4] Mining new protein-protein interactions
    Mamitsuka, H
    [J]. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2005, 24 (03): : 103 - 108
  • [5] Mining from protein-protein interactions
    Mamitsuka, Hiroshi
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (05) : 400 - 410
  • [6] Data mining methods for protein-protein interactions
    Nafar, Zahra
    Golshani, Ashkan
    [J]. 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 2090 - +
  • [7] Predicting protein-protein interactions by association mining
    Kotlyar, M
    Jurisica, I
    [J]. INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 37 - 46
  • [8] Predicting Protein-Protein Interactions by Association Mining
    [J]. Information Systems Frontiers, 2006, 8 : 37 - 47
  • [9] Biomedical literature mining for protein-protein interactions analysis using electronic mailing system
    Karthikeyan, Muthukumarasamy
    Vyas, Renu
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [10] Computational Approach to Biological Validation of Protein-Protein Interactions Discovered using Literature Mining
    Antony, Anna
    Basetty, Srilaxmi
    Hartanto, Shielly
    Palakal, Mathew
    [J]. APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1302 - 1306