Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning

被引:0
|
作者
Ye, Yuan-Nong [1 ,2 ,3 ]
Liang, Ding-Fa [2 ]
Labena, Abraham Alemayehu [4 ]
Zeng, Zhu [2 ]
机构
[1] Guizhou Med Univ, Sch Big Hlth, Dept Med Informat, Bioinformat & Biomed Big data Min Lab, Guiyang 550025, Peoples R China
[2] Guizhou Med Univ, Cells & Antibody Engn Res Ctr Guizhou Prov, Sch Biol & Engn, Key Lab Biol & Med Engn, Guiyang 550025, Peoples R China
[3] Guizhou Med Univ, Key Lab Environm Pollut Monitoring & Dis Control, Minist Educ, Guiyang 550025, Peoples R China
[4] Dilla Univ, Coll Computat & Nat Sci, Dilla 419, Ethiopia
来源
基金
中国国家自然科学基金;
关键词
Bioinformatics; biological information theory; biomedical informatics; PROTEIN;
D O I
10.32604/iasc.2023.026761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential ncRNA is a type of ncRNA which is indispensable for the sur-vival of organisms. Although essential ncRNAs cannot encode proteins, they are as important as essential coding genes in biology. They have got wide variety of applications such as antimicrobial target discovery, minimal genome construction and evolution analysis. At present, the number of species required for the deter-mination of essential ncRNAs in the whole genome scale is still very few due to the traditional methods are time-consuming, laborious and costly. In addition, tra-ditional experimental methods are limited by the organisms as less than 1% of bacteria can be cultured in the laboratory. Therefore, it is important and necessary to develop theories and methods for the recognition of essential non-coding RNA. In this paper, we present a novel method for predicting essential ncRNA by using both compositional and derivative features calculated by information theory of ncRNA sequences. The method was developed with Support Vector Machine (SVM). The accuracy of the method was evaluated through cross-species cross -vali-dation and found to be between 0.69 and 0.81. It shows that the features we selected have good performance for the prediction of essential ncRNA using SVM. Thus, the method can be applied for discovering essential ncRNAs in bacteria.
引用
收藏
页码:2731 / 2741
页数:11
相关论文
共 50 条
  • [31] Enhanced Non-parametric Sequence-based Learning Algorithm for Outlier Detection in the Internet of Things
    Edje, Abel Efetobor
    Abd Latiff, Shaffie Muhammad
    Chan, Howe Weng
    NEURAL PROCESSING LETTERS, 2021, 53 (03) : 1889 - 1919
  • [32] ATLAS: A Sequence-based Learning Approach for Attack Investigation
    Alsaheel, Abdulellah
    Nan, Yuhong
    Ma, Shiqing
    Yu, Le
    Walkup, Gregory
    Celik, Z. Berkay
    Zhang, Xiangyu
    Xu, Dongyan
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 3005 - 3022
  • [33] Establishment of a model for predicting preterm birth based on the machine learning algorithm
    Yao Zhang
    Sisi Du
    Tingting Hu
    Shichao Xu
    Hongmei Lu
    Chunyan Xu
    Jufang Li
    Xiaoling Zhu
    BMC Pregnancy and Childbirth, 23
  • [34] Predicting flow stress of Ni steel based on machine learning algorithm
    Cao, Guang-Ming
    Gao, Zhi-Wei
    Gao, Xin-Yu
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2022, 236 (08) : 4253 - 4266
  • [35] Establishment of a model for predicting preterm birth based on the machine learning algorithm
    Zhang, Yao
    Du, Sisi
    Hu, Tingting
    Xu, Shichao
    Lu, Hongmei
    Xu, Chunyan
    Li, Jufang
    Zhu, Xiaoling
    BMC PREGNANCY AND CHILDBIRTH, 2023, 23 (01)
  • [36] Sequence-based imitation learning for surgical robot operations
    Furnari, Gabriele
    Secchi, Cristian
    Ferraguti, Federica
    ARTIFICIAL INTELLIGENCE SURGERY, 2025, 5 (01): : 103 - 115
  • [37] A new sequence optimization algorithm based on particle swarm for machine learning
    Xie, Chaofan
    Zhang, Fuquan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2601 - 2619
  • [38] A Sequence-Based Machine Comprehension Modeling Using LSTM and GRU
    Viswanathan, Sujith
    Kumar, M. Anand
    Soman, K. P.
    EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 46 - 54
  • [39] Classification of multi-family enzymes by multi-label machine learning and sequence-based descriptors
    Wang, Yuelong
    Jing, Runyu
    Hua, Yongpan
    Fu, Yuanyuan
    Dai, Xu
    Huang, Liqiu
    Li, Menglong
    ANALYTICAL METHODS, 2014, 6 (17) : 6832 - 6840
  • [40] Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees
    Brand, Colin M.
    Kuang, Shuzhen
    Gilbertson, Erin N.
    McArthur, Evonne
    Pollard, Katherine S.
    Webster, Timothy H.
    Capra, John A.
    GENOME BIOLOGY AND EVOLUTION, 2024, 16 (11):