A text feature-based approach for literature mining of lncRNA-protein interactions

被引:15
|
作者
Li, Ao [1 ,2 ]
Zang, Qiguang [1 ]
Sun, Dongdong [1 ]
Wang, Minghui [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, 443 Huangshan Rd, Hefei 230027, Peoples R China
[2] Univ Sci & Technol China, Ctr Biomed Engn, 443 Huangshan Rd, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
LncRNA-protein interaction; Text mining; Text features; Machine learning; NONCODING RNAS; DATABASE;
D O I
10.1016/j.neucom.2015.11.110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long non-coding RNAs (lncRNAs) play important roles in regulating transcriptional and post transcriptional levels. Currently, Knowledge of lncRNA and protein interactions (LPIs) is crucial for biomedical researches that are related to lncRNA. Many freshly discovered LPIs are stored in biomedical literature. With over one million new biomedical journal articles published every year, just keeping up with the novel finding requires automatically extracting information by text mining. To address this issue, we apply a text feature-based text mining approach to efficiently extract LPIs from biomedical literatures. Our approach consists of four steps. By employ natural language processing (NLP) technologies, this approach extracts text features from sentences that can precisely reflect the real LPIs. Our approach involves four steps including data collection, text pre-processing, structured representation, features extraction and training model and classification. The F-score performance of our approach achieves 79.5%, and the results indicate that the proposed approach can efficiently extract LPIs from biomedical literature. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 80
页数:8
相关论文
共 50 条
  • [31] HLPI-Ensemble: Prediction of human lncRNA-protein interactions based on ensemble strategy
    Hu, Huan
    Zhang, Li
    Ai, Haixin
    Zhang, Hui
    Fan, Yetian
    Zhao, Qi
    Liu, Hongsheng
    RNA BIOLOGY, 2018, 15 (06) : 797 - 806
  • [32] Ontology-Guided Approach to Feature-Based Opinion Mining
    Penalver-Martinez, Isidro
    Valencia-Garcia, Rafael
    Garcia-Sanchez, Francisco
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 193 - 200
  • [33] Ontology-guided approach to Feature-Based Opinion Mining
    Penalver-Martinez, Isidro
    Garcia-Sanchez, Francisco
    Garcia, Rafael Valencia
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 91 - 98
  • [34] Predicting lncRNA-Protein Interactions With miRNAs as Mediators in a Heterogeneous Network Model
    Zhou, Yuan-Ke
    Shen, Zi-Ang
    Yu, Han
    Luo, Tao
    Gao, Yang
    Du, Pu-Feng
    FRONTIERS IN GENETICS, 2020, 10
  • [35] Text Mining: An Improvised Feature Based Model Approach
    Shivaprasad, K. M.
    Reddy, T. Hanumantha
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 38 - 42
  • [36] Literature mining of host-pathogen interactions: comparing feature-based supervised learning and language-based approaches
    Thanh Thieu
    Joshi, Sneha
    Warren, Samantha
    Korkin, Dmitry
    BIOINFORMATICS, 2012, 28 (06) : 867 - 875
  • [37] Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture
    Peng, Lihong
    Wang, Chang
    Tian, Xiongfei
    Zhou, Liqian
    Li, Keqin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3456 - 3468
  • [38] Feature-based Assessment of Text Readability
    Zhang, Lixiao
    Liu, Zaiying
    Ni, Jun
    2013 SEVENTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR ENGINEERING AND SCIENCE (ICICSE 2013), 2013, : 51 - 54
  • [39] A feature-based approach to discrimination and prediction of protein folding
    Mirkin, B
    Ritter, O
    GENOMICS AND PROTEOMICS: FUNCTIONAL AND COMPUTATIONAL ASPECTS, 2000, : 157 - 177
  • [40] Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding
    Li, Tan
    Li, Mengshan
    Fu, Yu
    Li, Yelin
    Zhu, Jihong
    Guan, Lixin
    BMC GENOMICS, 2024, 25 (01):