Distributed smoothed tree kernel for protein-protein interaction extraction from the biomedical literature

被引:16
|
作者
Murugesan, Gurusamy [1 ]
Abdulkadhar, Sabenabanu [1 ]
Natarajan, Jeyakumar [1 ]
机构
[1] Bharathiar Univ, Dept Bioinformat, Data Min & Text Min Lab, Coimbatore, Tamil Nadu, India
来源
PLOS ONE | 2017年 / 12卷 / 11期
关键词
PREDICTION;
D O I
10.1371/journal.pone.0187379
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automatic extraction of protein-protein interaction (PPI) pairs from biomedical literature is a widely examined task in biological information extraction. Currently, many kernel based approaches such as linear kernel, tree kernel, graph kernel and combination of multiple kernels has achieved promising results in PPI task. However, most of these kernel methods fail to capture the semantic relation information between two entities. In this paper, we present a special type of tree kernel for PPI extraction which exploits both syntactic (structural) and semantic vectors information known as Distributed Smoothed Tree kernel (DSTK). DSTK comprises of distributed trees with syntactic information along with distributional semantic vectors representing semantic information of the sentences or phrases. To generate robust machine learning model composition of feature based kernel and DSTK were combined using ensemble support vector machine (SVM). Five different corpora (AIMed, BioInfer, HPRD50, IEPA, and LLL) were used for evaluating the performance of our system. Experimental results show that our system achieves better f-score with five different corpora compared to other state-of-the-art systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Tree kernel-based protein-protein interaction extraction from biomedical literature
    Qian, Longhua
    Zhou, Guodong
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (03) : 535 - 543
  • [2] Multiple kernel learning in protein-protein interaction extraction from biomedical literature
    Yang, Zhihao
    Tang, Nan
    Zhang, Xiao
    Lin, Hongfei
    Li, Yanpeng
    Yang, Zhiwei
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2011, 51 (03) : 163 - 173
  • [3] A tree kernel-based method for protein-protein interaction mining from biomedical literature
    Eom, Jae-Hong
    Kim, Sun
    Kim, Seong-Hwan
    Zhang, Byoung-Tak
    [J]. KNOWLEDGE DISCOVERY IN LIFE SCIENCE LITERATURE, PROCEEDINGS, 2006, 3886 : 42 - 52
  • [4] Protein-protein interaction extraction from biomedical literatures based on a combined kernel
    Li, Lishuang
    Ping, Jinyu
    Huang, Degen
    [J]. Journal of Information and Computational Science, 2010, 7 (05): : 1065 - 1073
  • [5] BioPPIExtractor: A protein-protein interaction extraction system for biomedical literature
    Yang, Zhihao
    Lin, Hongfei
    Wu, Baodong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 2228 - 2233
  • [6] A Hybrid Protein-Protein Interaction Triple Extraction Method for Biomedical Literature
    Zhao, Zhehuan
    Yang, Zhihao
    Sun, Cong
    Wang, Lei
    Lin, Hongfei
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1515 - 1521
  • [7] Deep Neural Network Based Protein-Protein Interaction Extraction from Biomedical Literature
    Zhao, Zhehuan
    Yang, Zhihao
    Luo, Ling
    Lin, Hongfei
    Wang, Jian
    Gao, Song
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1156 - 1156
  • [8] Extracting and mining protein-protein interaction network from biomedical literature
    Hu, XH
    Yoo, IH
    Song, IY
    Song, M
    Han, JC
    Lechner, M
    [J]. PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 244 - 251
  • [9] Ranking SVM for Multiple Kernels Output Combination in Protein-Protein Interaction Extraction from Biomedical Literature
    Yang, Zhihao
    Lin, Yuan
    Wu, Jiajin
    Tang, Nan
    Lin, Hongfei
    Li, Yanpeng
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 595 - 598
  • [10] Uncertainty sampling-based active learning for protein-protein interaction extraction from biomedical literature
    Cui, Baojin
    Lin, Hongfei
    Yang, Zhihao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10344 - 10350