BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph

被引:2
|
作者
Peng, Yifan [1 ]
Arighi, Cecilia [1 ,2 ]
Wu, Cathy H. [1 ,2 ]
Vijay-Shanker, K. [1 ]
机构
[1] Univ Delaware, Comp & Informat Sci, Newark, DE 19716 USA
[2] Univ Delaware, Ctr Bioinformat & Computat Biol, Newark, DE 19716 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/database/baw072
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and developed a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Efficient Extraction of Protein-Protein Interactions from Full-Text Articles
    Hakenberg, Joerg
    Leaman, Robert
    Vo, Nguyen Ha
    Jonnalagadda, Siddhartha
    Sullivan, Ryan
    Miller, Christopher
    Tari, Luis
    Baral, Chitta
    Gonzalez, Graciela
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (03) : 481 - 494
  • [2] The BioC-BioGRID corpus: full text articles annotated for curation of protein-protein and genetic interactions
    Dogan, Rezarta Islamaj
    Kim, Sun
    Chatr-aryamontri, Andrew
    Chang, Christie S.
    Oughtred, Rose
    Rust, Jennifer
    Wilbur, W. John
    Comeau, Donald C.
    Dolinski, Kara
    Tyers, Mike
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
  • [3] Classification of Protein-Protein Interaction Full-Text Documents Using Text and Citation Network Features
    Kolchinsky, Artemy
    Abi-Haidar, Alaa
    Kaur, Jasleen
    Hamed, Ahmed Abdeen
    Rocha, Luis M.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (03) : 400 - 411
  • [4] Predicting Protein-Protein Interactions Using Full Bayesian Network
    Li, Hui
    Liu, Chunmei
    Burge, Legand
    Ko, Kyung Dae
    Southerland, William
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [5] Exploring useful features and kernel combinations from dependency graph for protein-protein interactions extraction
    Wang, Jian
    Ji, Minghui
    Lin, Hongfei
    Journal of Computational Information Systems, 2012, 8 (03): : 1221 - 1228
  • [6] Automatic extraction of protein-protein interactions using grammatical relationship graph
    Yu, Kaixian
    Lung, Pei-Yau
    Zhao, Tingting
    Zhao, Peixiang
    Tseng, Yan-Yuan
    Zhang, Jinfeng
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [7] Predicting protein-protein interactions using graph invariants and a neural network
    Knisley, D.
    Knisley, J.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (02) : 108 - 113
  • [8] Automatic extraction of protein-protein interactions using grammatical relationship graph
    Kaixian Yu
    Pei-Yau Lung
    Tingting Zhao
    Peixiang Zhao
    Yan-Yuan Tseng
    Jinfeng Zhang
    BMC Medical Informatics and Decision Making, 18
  • [9] Predicting Missing and Spurious Protein-Protein Interactions Using Graph Embeddings on GO Annotation Graph
    Zhong, Xiaoshi
    Rajapakse, Jagath C.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1828 - 1835
  • [10] Detection of protein-protein interactions in plants using bimolecular fluorescence complementation
    Bracha-Drori, K
    Shichrur, K
    Katz, A
    Oliva, M
    Angelovici, R
    Yalovsky, S
    Ohad, N
    PLANT JOURNAL, 2004, 40 (03): : 419 - 427