A general protein-protein interaction extraction architecture based on word representation and feature selection

被引:9
|
作者
Jiang, Zhenchao [1 ]
Li, Lishuang [1 ]
Huang, Degen [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, 2 Linggong Rd, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
instance representation; word representation; protein-protein interaction; relation extraction; biomedical text mining; INFORMATION EXTRACTION;
D O I
10.1504/IJDMB.2016.074878
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Previous researches have shown that supervised Protein-Protein Interaction Extraction (PPIE) can get high accuracies with elaborately selected features and kernels. However, most features and kernels rest upon domain knowledge and natural language analysis, which makes the supervised model expensive, heavy and brittle. Moreover, commonly used representation techniques, such as one-hot encoding and Vector Space Model, fail to capture the semantic similarity between words. To reduce the manual labour and take advantage of semantic representation, we put forward a general instance representation architecture for PPIE, which integrates word representation, vector composition and feature selection. Our method obtains F-scores of 69.7, 78.8, 72.3, 72.0 and 83.7 on AIMed, BioInfer, HPRD50, IEPA and LLL respectively.
引用
收藏
页码:276 / 291
页数:16
相关论文
共 50 条
  • [1] A General Instance Representation Architecture for Protein-Protein Interaction Extraction
    Li, Lishuang
    Jiang, Zhenchao
    Huang, Degen
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [2] Improving Kernel-Based Protein-Protein Interaction Extraction by Unsupervised Word Representation
    Li, Lishuang
    Guo, Rui
    Jiang, Zhenchao
    Huang, Degen
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [3] Exploiting Dependency Information for Feature-based Protein-Protein Interaction Extraction
    Liu, Bing
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    [J]. 2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 99 - 102
  • [4] Exploiting dependency information for feature-based protein-protein interaction extraction
    Liu, Bing
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    [J]. Advances in Intelligent and Soft Computing, 2011, 111 : 267 - 272
  • [5] Exploiting Dependency Information for Feature-Based Protein-Protein Interaction Extraction
    Liu, Bing
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    [J]. PROCEEDINGS OF THE 2011 INTERNATIONAL CONFERENCE ON INFORMATICS, CYBERNETICS, AND COMPUTER ENGINEERING (ICCE2011), VOL 2: INFORMATION SYSTEMS AND COMPUTER ENGINEERING, 2011, 111 : 267 - 272
  • [6] Applying Feature Coupling Generalization for Protein-Protein Interaction Extraction
    Li, Yanpeng
    Lin, Hongfei
    Yang, Zhihao
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 396 - 400
  • [7] Improving neural protein-protein interaction extraction with knowledge selection
    Zhou, Huiwei
    Li, Xuefei
    Yao, Weihong
    Liu, Zhuang
    Ning, Shixian
    Lang, Chengkun
    Du, Lei
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 83
  • [8] Protein-protein interaction extraction with feature selection by evaluating contribution levels of groups consisting of related features
    Thi Thanh Thuy Phan
    Takenao Ohkawa
    [J]. BMC Bioinformatics, 17
  • [9] Protein-protein interaction extraction with feature selection by evaluating contribution levels of groups consisting of related features
    Thi Thanh Thuy Phan
    Ohkawa, Takenao
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [10] A novel feature extraction scheme for prediction of protein-protein interaction sites
    Du, Xiuquan
    Jing, Anqi
    Hu, Xinying
    [J]. MOLECULAR BIOSYSTEMS, 2015, 11 (02) : 475 - 485