Prediction and Identification of Kruppel-Like Transcription Factors by Machine Learning Method

被引:8
|
作者
Liao, Zhijun [1 ,2 ]
Wang, Xinrui [3 ]
Chen, Xingyong [4 ]
Zou, Quan [2 ,5 ]
机构
[1] Fujian Med Univ, Sch Basic Med Sci, Dept Biochem & Mol Biol, Fuzhou, Fujian, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[3] Shanghai Jiao Tong Univ, Rui Jin Hosp, Sch Med, State Key Lab Med Genom,Shanghai Inst Hematol, Shanghai, Peoples R China
[4] Fujian Med Univ, Shengli Clin Coll, Fujian Prov Hosp, Dept Neurol, Fuzhou, Fujian, Peoples R China
[5] Nankai Univ, State Key Lab Med Chem Biol, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Kruppel-like factor; binary-class classification; phylogenetic analysis; motif; a library for support vector machine; machine learning method; SEQUENCE-BASED PREDICTOR; WEB SERVER; SUBCELLULAR-LOCALIZATION; MYCOBACTERIAL PROTEINS; TUMOR-SUPPRESSOR; EXPRESSION; SITES; SOFTWARE; GENE; MODULATION;
D O I
10.2174/1386207320666170314094951
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: The Kruppel-like factors (KLFs) are a family of containing Zn finger(ZF) motif transcription factors with 18 members in human genome, among them, KLF18 is predicted by bioinformatics. KLFs possess various physiological function involving in a number of cancers and other diseases. Here we perform a binary-class classification of KLFs and non-KLFs by machine learning methods. Material and Method: The protein sequences of KLFs and non-KLFs were searched from UniProt and randomly separate them into training dataset(containing positive and negative sequences) and test dataset(containing only negative sequences), after extracting the 188-dimensional(188D) feature vectors we carry out category with four classifiers(GBDT, libSVM, RF, and k-NN). On the human KLFs, we further dig into the evolutionary relationship and motif distribution, and finally we analyze the conserved amino acid residue of three zinc fingers. Results: The classifier model from training dataset were well constructed, and the highest specificity(Sp) was 99.83% from a library for support vector machine(libSVM) and all the correctly classified rates were over 70% for 10-fold cross-validation on test dataset. The 18 human KLFs can be further divided into 7 groups and the zinc finger domains were located at the carboxyl terminus, and many conserved amino acid residues including Cysteine and Histidine, and the span and interval between them were consistent in the three ZF domains. Conclusion: Two classification models for KLFs prediction have been built by novel machine learning methods.
引用
下载
收藏
页码:594 / 602
页数:9
相关论文
共 50 条
  • [1] Role of kruppel-like transcription factors in adipogenesis
    Wu, Zeni
    Wang, Suqing
    DEVELOPMENTAL BIOLOGY, 2013, 373 (02) : 235 - 243
  • [2] Kruppel-like transcription factors: A functional family
    Pearson, Richard
    Fleetwood, Jacqueline
    Eaton, Sally
    Crossley, Merlin
    Bao, Shisan
    INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2008, 40 (10): : 1996 - 2001
  • [3] Kruppel-like transcription factors and control of pluripotency
    Bourillot, Pierre-Yves
    Savatier, Pierre
    BMC BIOLOGY, 2010, 8
  • [4] Role of Kruppel-like transcription factors in endothelial biology
    Atkins, G. Brandon
    Jain, Mukesh K.
    CIRCULATION RESEARCH, 2007, 100 (12) : 1686 - 1695
  • [5] Sp1-and Kruppel-like transcription factors
    Kaczynski, J
    Cook, T
    Urrutia, R
    GENOME BIOLOGY, 2003, 4 (02)
  • [6] The biology of the mammalian Kruppel-like family of transcription factors
    Dang, DT
    Pevsner, J
    Yang, VW
    INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2000, 32 (11-12): : 1103 - 1121
  • [7] Vascular implications of the Kruppel-like family of transcription factors
    Suzuki, T
    Aizawa, K
    Matsumura, T
    Nagai, R
    ARTERIOSCLEROSIS THROMBOSIS AND VASCULAR BIOLOGY, 2005, 25 (06) : 1135 - 1141
  • [8] Kruppel-like factors in cancer
    Tetreault, Marie-Pier
    Yang, Yizeng
    Katz, Jonathan P.
    NATURE REVIEWS CANCER, 2013, 13 (10) : 701 - 713
  • [9] Kruppel-like Factors in Cancers
    Yang, Vincent W.
    BIOLOGY OF KRUPPEL-LIKE FACTORS, 2009, : 205 - 219
  • [10] Kruppel-like Factors in the Heart
    Kawanami, Daiji
    Haldar, Saptarsi M.
    Jain, Mukesh K.
    BIOLOGY OF KRUPPEL-LIKE FACTORS, 2009, : 159 - 171