Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning

被引:0
|
作者
Ye, Yuan-Nong [1 ,2 ,3 ]
Liang, Ding-Fa [2 ]
Labena, Abraham Alemayehu [4 ]
Zeng, Zhu [2 ]
机构
[1] Guizhou Med Univ, Sch Big Hlth, Dept Med Informat, Bioinformat & Biomed Big data Min Lab, Guiyang 550025, Peoples R China
[2] Guizhou Med Univ, Cells & Antibody Engn Res Ctr Guizhou Prov, Sch Biol & Engn, Key Lab Biol & Med Engn, Guiyang 550025, Peoples R China
[3] Guizhou Med Univ, Key Lab Environm Pollut Monitoring & Dis Control, Minist Educ, Guiyang 550025, Peoples R China
[4] Dilla Univ, Coll Computat & Nat Sci, Dilla 419, Ethiopia
来源
基金
中国国家自然科学基金;
关键词
Bioinformatics; biological information theory; biomedical informatics; PROTEIN;
D O I
10.32604/iasc.2023.026761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential ncRNA is a type of ncRNA which is indispensable for the sur-vival of organisms. Although essential ncRNAs cannot encode proteins, they are as important as essential coding genes in biology. They have got wide variety of applications such as antimicrobial target discovery, minimal genome construction and evolution analysis. At present, the number of species required for the deter-mination of essential ncRNAs in the whole genome scale is still very few due to the traditional methods are time-consuming, laborious and costly. In addition, tra-ditional experimental methods are limited by the organisms as less than 1% of bacteria can be cultured in the laboratory. Therefore, it is important and necessary to develop theories and methods for the recognition of essential non-coding RNA. In this paper, we present a novel method for predicting essential ncRNA by using both compositional and derivative features calculated by information theory of ncRNA sequences. The method was developed with Support Vector Machine (SVM). The accuracy of the method was evaluated through cross-species cross -vali-dation and found to be between 0.69 and 0.81. It shows that the features we selected have good performance for the prediction of essential ncRNA using SVM. Thus, the method can be applied for discovering essential ncRNAs in bacteria.
引用
收藏
页码:2731 / 2741
页数:11
相关论文
共 50 条
  • [21] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18
  • [22] EnContact: predicting enhancer-enhancer contacts using sequence-based deep learning model
    Gan, Mingxin
    Li, Wenran
    Jiang, Rui
    PEERJ, 2019, 7
  • [23] Sequence-based statistical downscaling and its application to hydrologic simulations based on machine learning and big data
    Wang, Qingrui
    Huang, Jing
    Liu, Ruimin
    Men, Cong
    Guo, Lijia
    Miao, Yuexi
    Jiao, Lijun
    Wang, Yifan
    Shoaib, Muhammad
    Xia, Xinghui
    JOURNAL OF HYDROLOGY, 2020, 586
  • [24] BPP: a sequence-based algorithm for branch point prediction
    Zhang, Qing
    Fan, Xiaodan
    Wang, Yejun
    Sun, Ming-an
    Shao, Jianlin
    Guo, Dianjing
    BIOINFORMATICS, 2017, 33 (20) : 3166 - 3172
  • [25] VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques
    Yadav, Arvind Kumar
    Singla, Deepak
    JOURNAL OF BIOSCIENCES, 2020, 45 (01)
  • [26] An Improved Sequence-based Indoor Localization Algorithm in WSNs
    Yu, Ying
    Yuan, Lingyun
    Kuang, Yulan
    2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 923 - 926
  • [27] VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques
    Arvind Kumar Yadav
    Deepak Singla
    Journal of Biosciences, 2020, 45
  • [28] An Algorithm for Forward Reduction in Sequence-Based Software Specification
    Lin, Lan
    Xue, Yufeng
    Song, Fengguang
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2016, 26 (9-10) : 1431 - 1451
  • [29] Predicting Salmonella enterica serotypes by repetitive sequence-based PCR
    Wise, Mark G.
    Siragusa, Gregory R.
    Plumblee, Jodie
    Healy, Mimi
    Cray, Paula J.
    Seal, Bruce S.
    JOURNAL OF MICROBIOLOGICAL METHODS, 2009, 76 (01) : 18 - 24
  • [30] Enhanced Non-parametric Sequence-based Learning Algorithm for Outlier Detection in the Internet of Things
    Abel Efetobor Edje
    Shaffie Muhammad Abd Latiff
    Howe Weng Chan
    Neural Processing Letters, 2021, 53 : 1889 - 1919