A text feature-based approach for literature mining of lncRNA-protein interactions

被引:15
|
作者
Li, Ao [1 ,2 ]
Zang, Qiguang [1 ]
Sun, Dongdong [1 ]
Wang, Minghui [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, 443 Huangshan Rd, Hefei 230027, Peoples R China
[2] Univ Sci & Technol China, Ctr Biomed Engn, 443 Huangshan Rd, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
LncRNA-protein interaction; Text mining; Text features; Machine learning; NONCODING RNAS; DATABASE;
D O I
10.1016/j.neucom.2015.11.110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long non-coding RNAs (lncRNAs) play important roles in regulating transcriptional and post transcriptional levels. Currently, Knowledge of lncRNA and protein interactions (LPIs) is crucial for biomedical researches that are related to lncRNA. Many freshly discovered LPIs are stored in biomedical literature. With over one million new biomedical journal articles published every year, just keeping up with the novel finding requires automatically extracting information by text mining. To address this issue, we apply a text feature-based text mining approach to efficiently extract LPIs from biomedical literatures. Our approach consists of four steps. By employ natural language processing (NLP) technologies, this approach extracts text features from sentences that can precisely reflect the real LPIs. Our approach involves four steps including data collection, text pre-processing, structured representation, features extraction and training model and classification. The F-score performance of our approach achieves 79.5%, and the results indicate that the proposed approach can efficiently extract LPIs from biomedical literature. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 80
页数:8
相关论文
共 50 条
  • [41] Analysis of protein/protein interactions through biomedical literature: Text mining of abstracts vs. text mining of full text articles
    Martin, EPG
    Bremer, EG
    Guerin, MC
    DeSesa, C
    Jouve, O
    KNOWLEDGE EXPLORATION IN LIFE SCIENCE INFORMATICS, PROCEEDINGS, 2004, 3303 : 96 - 108
  • [42] Constructing discriminative feature space for LncRNA-protein interaction based on deep autoencoder and marginal fisher analysis
    Teng, Zhixia
    Zhang, Yiran
    Dai, Qiguo
    Wu, Chengyan
    Li, Dan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [43] Feature-based opinion mining and ranking
    Eirinaki, Magdalini
    Pisal, Shamita
    Singh, Japinder
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (04) : 1175 - 1184
  • [44] Feature-Based Opinion Mining Approach (FOMA) for Improved Book Recommendation
    Shahab Saquib Sohail
    Jamshed Siddiqui
    Rashid Ali
    Arabian Journal for Science and Engineering, 2018, 43 : 8029 - 8048
  • [45] Feature-Based Opinion Mining Approach (FOMA) for Improved Book Recommendation
    Sohail, Shahab Saquib
    Siddiqui, Jamshed
    Ali, Rashid
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 8029 - 8048
  • [46] Fusion of multi-source relationships and topology to infer lncRNA-protein interactions
    Zhang, Xinyu
    Liu, Mingzhe
    Li, Zhen
    Zhuo, Linlin
    Fu, Xiangzheng
    Zou, Quan
    MOLECULAR THERAPY NUCLEIC ACIDS, 2024, 35 (02):
  • [47] Multi-feature fusion for deep learning to predict plant lncRNA-protein interaction
    Wekesa, Jael Sanyanda
    Meng, Jun
    Luan, Yushi
    GENOMICS, 2020, 112 (05) : 2928 - 2936
  • [48] Fusing multiple protein-protein similarity networks to effectively predict lncRNA-protein interactions
    Xiaoxiong Zheng
    Yang Wang
    Kai Tian
    Jiaogen Zhou
    Jihong Guan
    Libo Luo
    Shuigeng Zhou
    BMC Bioinformatics, 18
  • [49] Fusing multiple protein-protein similarity networks to effectively predict lncRNA-protein interactions
    Zheng, Xiaoxiong
    Wang, Yang
    Tian, Kai
    Zhou, Jiaogen
    Guan, Jihong
    Luo, Libo
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2017, 18
  • [50] LPI-IBWA: Predicting lncRNA-protein interactions based on an improved Bi-Random walk
    Xie, Minzhu
    Xie, Ruijie
    Wang, Hao
    METHODS, 2023, 220 : 98 - 105