A text feature-based approach for literature mining of lncRNA-protein interactions

被引:15
|
作者
Li, Ao [1 ,2 ]
Zang, Qiguang [1 ]
Sun, Dongdong [1 ]
Wang, Minghui [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, 443 Huangshan Rd, Hefei 230027, Peoples R China
[2] Univ Sci & Technol China, Ctr Biomed Engn, 443 Huangshan Rd, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
LncRNA-protein interaction; Text mining; Text features; Machine learning; NONCODING RNAS; DATABASE;
D O I
10.1016/j.neucom.2015.11.110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long non-coding RNAs (lncRNAs) play important roles in regulating transcriptional and post transcriptional levels. Currently, Knowledge of lncRNA and protein interactions (LPIs) is crucial for biomedical researches that are related to lncRNA. Many freshly discovered LPIs are stored in biomedical literature. With over one million new biomedical journal articles published every year, just keeping up with the novel finding requires automatically extracting information by text mining. To address this issue, we apply a text feature-based text mining approach to efficiently extract LPIs from biomedical literatures. Our approach consists of four steps. By employ natural language processing (NLP) technologies, this approach extracts text features from sentences that can precisely reflect the real LPIs. Our approach involves four steps including data collection, text pre-processing, structured representation, features extraction and training model and classification. The F-score performance of our approach achieves 79.5%, and the results indicate that the proposed approach can efficiently extract LPIs from biomedical literature. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 80
页数:8
相关论文
共 50 条
  • [21] A novel feature-based approach to extract drug-drug interactions from biomedical text
    Bui, Quoc-Chinh
    Sloot, Peter M. A.
    van Mulligen, Erik M.
    Kors, Jan A.
    BIOINFORMATICS, 2014, 30 (23) : 3365 - 3371
  • [22] Probing lncRNA-Protein Interactions: Data Repositories, Models, and Algorithms
    Peng, Lihong
    Liu, Fuxing
    Yang, Jialiang
    Liu, Xiaojun
    Meng, Yajie
    Deng, Xiaojun
    Peng, Cheng
    Tian, Geng
    Zhou, Liqian
    FRONTIERS IN GENETICS, 2020, 10
  • [23] Predicting lncRNA-protein Interactions by Machine Learning Methods: A Review
    Liu, Zhi-Ping
    CURRENT BIOINFORMATICS, 2020, 15 (08) : 831 - 840
  • [24] Prediction of interactions between lncRNA and protein by using relevance search in a heterogeneous lncRNA-protein network
    Yang, Jianghong
    Li, Ao
    Ge, Mengqu
    Wang, Minghui
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 8540 - 8544
  • [25] Diverging RNPs: Toward Understanding lncRNA-Protein Interactions and Functions
    Sauvageau, Martin
    BIOLOGY OF MRNA: STRUCTURE AND FUNCTION, 2019, 1203 : 285 - 312
  • [26] Computational Prediction of lncRNA-Protein Interactions using Machine learning
    Mushtaq, Muhammad
    Naveed, Hammad
    Khalid, Zoya
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2100 - 2103
  • [27] A comprehensive review of methods to study lncRNA-protein interactions in solution
    Badmalia, Maulik D.
    Pereira, Higor Sette
    Siddiqui, M. Quadir
    Patel, Trushar R.
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2022, : 1415 - 1426
  • [28] Prediction of lncRNA-Protein Interactions via the Multiple Information Integration
    Chen, Yifan
    Fu, Xiangzheng
    Li, Zejun
    Peng, Li
    Zhuo, Linlin
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9
  • [29] LPInsider: a webserver for lncRNA-protein interaction extraction from the literature
    Li, Ying
    Wei, Lizheng
    Wang, Cankun
    Zhao, Jianing
    Han, Siyu
    Zhang, Yu
    Du, Wei
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [30] RPIPCM: A deep network model for predicting lncRNA-protein interaction based on sequence feature encoding
    Gong, Lejun
    Chen, Jingmei
    Cui, Xiong
    Liu, Yang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165