A text feature-based approach for literature mining of lncRNA-protein interactions

被引:15
|
作者
Li, Ao [1 ,2 ]
Zang, Qiguang [1 ]
Sun, Dongdong [1 ]
Wang, Minghui [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, 443 Huangshan Rd, Hefei 230027, Peoples R China
[2] Univ Sci & Technol China, Ctr Biomed Engn, 443 Huangshan Rd, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
LncRNA-protein interaction; Text mining; Text features; Machine learning; NONCODING RNAS; DATABASE;
D O I
10.1016/j.neucom.2015.11.110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long non-coding RNAs (lncRNAs) play important roles in regulating transcriptional and post transcriptional levels. Currently, Knowledge of lncRNA and protein interactions (LPIs) is crucial for biomedical researches that are related to lncRNA. Many freshly discovered LPIs are stored in biomedical literature. With over one million new biomedical journal articles published every year, just keeping up with the novel finding requires automatically extracting information by text mining. To address this issue, we apply a text feature-based text mining approach to efficiently extract LPIs from biomedical literatures. Our approach consists of four steps. By employ natural language processing (NLP) technologies, this approach extracts text features from sentences that can precisely reflect the real LPIs. Our approach involves four steps including data collection, text pre-processing, structured representation, features extraction and training model and classification. The F-score performance of our approach achieves 79.5%, and the results indicate that the proposed approach can efficiently extract LPIs from biomedical literature. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 80
页数:8
相关论文
共 50 条
  • [1] Extracting LncRNA-protein Interactions from Literature Using a Text Feature-based Approach
    Zang, Qiguang
    Sun, Dongdong
    Feng, Huanqing
    Li, Ao
    IFAC PAPERSONLINE, 2015, 48 (28): : 22 - 26
  • [2] Prediction of plant LncRNA-protein interactions based on feature fusion and an improved residual network
    Zhang, Lina
    Yang, Runtao
    Xia, Defei
    Lin, Xiaorui
    Xiong, Wanying
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Function of lncRNAs and approaches to lncRNA-protein interactions
    JuanJuan Zhu
    HanJiang Fu
    YongGe Wu
    XiaoFei Zheng
    Science China Life Sciences, 2013, 56 : 876 - 885
  • [4] Function of lncRNAs and approaches to lncRNA-protein interactions
    Zhu JuanJuan
    Fu HanJiang
    Wu YongGe
    Zheng XiaoFei
    SCIENCE CHINA-LIFE SCIENCES, 2013, 56 (10) : 876 - 885
  • [5] A Feature-Based Approach to Modeling Protein-DNA Interactions
    Sharon, Eilon
    Lubliner, Shai
    Segal, Eran
    PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (08)
  • [6] A feature-based approach to modeling protein-DNA interactions
    Sharon, Eilon
    Segal, Eran
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2007, 4453 : 77 - +
  • [7] Function of lncRNAs and approaches to lncRNA-protein interactions
    ZHU JuanJuan
    FU HanJiang
    WU YongGe
    ZHENG XiaoFei
    Science China(Life Sciences), 2013, 56 (10) : 876 - 885
  • [8] Function of lncRNAs and approaches to lncRNA-protein interactions
    ZHU JuanJuan
    FU HanJiang
    WU YongGe
    ZHENG XiaoFei
    Science China(Life Sciences) , 2013, (10) : 876 - 885
  • [9] Predicting lncRNA-Protein Interactions Based on Protein-Protein Similarity Network Fusion
    Zheng, Xiaoxiang
    Tian, Kai
    Wang, Yang
    Guan, Jihong
    Zhou, Shuigeng
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2016, 2016, 9683 : 321 - 322
  • [10] SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions
    Zhang, Wen
    Yue, Xiang
    Tang, Guifeng
    Wu, Wenjian
    Huang, Feng
    Zhang, Xining
    PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (12)