Construction of precise support vector machine based models for predicting promoter strength

被引:0
|
作者
Hailin Meng [1 ]
Yingfei Ma [2 ]
Guoqin Mai [2 ]
Yong Wang [3 ]
Chenli Liu [1 ,2 ]
机构
[1] Bioengineering Research Center, Guangzhou Institute of Advanced Technology, Chinese Academy of Sciences
[2] Center for Synthetic Biology Engineering Research, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
[3] Chinese Academy of Sciences Key Laboratory of Synthetic Biology, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of
关键词
D O I
暂无
中图分类号
Q811.4 [生物信息论];
学科分类号
0711 ; 0831 ;
摘要
Background: The prediction of the prokaryotic promoter strength based on its sequence is of great importance not only in the fundamental research of life sciences but also in the applied aspect of synthetic biology. Much advance has been made to build quantitative models for strength prediction, especially the introduction of machine learning methods such as artificial neural network(ANN) has significantly improve the prediction accuracy. As one of the most important machine learning methods, support vector machine(SVM) is more powerful to learn knowledge from small sample dataset and thus supposed to work in this problem.Methods: To confirm this, we constructed SVM based models to quantitatively predict the promoter strength. A library of 100 promoter sequences and strength values was randomly divided into two datasets, including a training set(≥10 sequences) for model training and a test set(≥10 sequences) for model test.Results: The results indicate that the prediction performance increases with an increase of the size of training set, and the best performance was achieved at the size of 90 sequences. After optimization of the model parameters, a highperformance model was finally trained, with a high squared correlation coefficient for fitting the training set(R2> 0.99) and the test set(R2> 0.98), both of which are better than that of ANN obtained by our previous work.Conclusions: Our results demonstrate the SVM-based models can be employed for the quantitative prediction of promoter strength.
引用
收藏
页码:90 / 98
页数:9
相关论文
共 50 条
  • [41] ViralmiR: a support-vector-machine-based method for predicting viral microRNA precursors
    Huang, Kai-Yao
    Lee, Tzong-Yi
    Teng, Yu-Chuan
    Chang, Tzu-Hao
    BMC BIOINFORMATICS, 2015, 16
  • [42] ViralmiR: a support-vector-machine-based method for predicting viral microRNA precursors
    Kai-Yao Huang
    Tzong-Yi Lee
    Yu-Chuan Teng
    Tzu-Hao Chang
    BMC Bioinformatics, 16
  • [43] A Support Vector Machine Based Approach for Predicting the Risk of Freshwater Disease Emergence in England
    Hassani, Hossein
    Silva, Emmanuel
    Combe, Marine
    Andreou, Demetra
    Ghodsi, Mansi
    Yeganegi, Mohammad
    Gozlan, Rodolphe
    STATS, 2019, 2 (01): : 89 - 103
  • [44] An improved Fuzzy Twin Support Vector Machine Based on Support Vector
    Wu Qing
    Sun Kaiyue
    2017 13TH IEEE CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2017, : 1130 - 1135
  • [45] IMPROVING STRUCTURE BASED MODELS FOR PREDICTING CHEMICAL FUNCTIONS AND WEIGHT FRACTIONS IN COSMETIC PRODUCTS USING ENSEMBLE SUPPORT VECTOR MACHINE
    Vojinovic, Tanja
    Kascelan, Ljiljana
    Potpara, Zorica
    Radonjic, Natasa
    Kascelan, Zorka
    FARMACIA, 2022, 70 (02) : 307 - 319
  • [46] Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine
    Jian-Ding Qiu
    San-Hua Luo
    Jian-Hua Huang
    Xing-Yu Sun
    Ru-Ping Liang
    Amino Acids, 2010, 38 : 1201 - 1208
  • [47] Predicting corporate financial distress based on integration of support vector machine and logistic regression
    Hua, Zhongsheng
    Wang, Yu
    Xu, Xiaoyan
    Zhang, Bin
    Liang, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (02) : 434 - 440
  • [48] Support vector machine: Classifying and predicting mutagenicity of complex mixtures based on pollution profiles
    Zheng, Weiwei
    Tian, Dajun
    Wang, Xia
    Tian, Weidong
    Zhang, Hao
    Jiang, Songhui
    He, Gengsheng
    Zheng, Yuxin
    Qu, Weidong
    TOXICOLOGY, 2013, 313 (2-3) : 151 - 159
  • [49] Predicting Hepatotoxicity of Drug Metabolites Via an Ensemble Approach Based on Support Vector Machine
    Lu, Yin
    Liu, Lili
    Lu, Dong
    Cai, Yudong
    Zheng, Mingyue
    Luo, Xiaomin
    Jiang, Hualiang
    Chen, Kaixian
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (10) : 839 - 849
  • [50] Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine
    Qiu, Jian-Ding
    Luo, San-Hua
    Huang, Jian-Hua
    Sun, Xing-Yu
    Liang, Ru-Ping
    AMINO ACIDS, 2010, 38 (04) : 1201 - 1208