Improved method for predicting protein fold patterns with ensemble classifiers

被引:21
|
作者
Chen, W. [1 ]
Liu, X. [1 ,3 ,4 ]
Huang, Y. [2 ]
Jiang, Y. [1 ]
Zou, Q. [1 ]
Lin, C. [1 ]
机构
[1] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Fujian, Peoples R China
[2] Henan Univ Sci & Technol, Anim Sci & Technol Coll, Luoyang, Henan, Peoples R China
[3] Xiamen Univ, Shenzhen Res Inst, Guangzhou, Guangdong, Peoples R China
[4] Dalian Univ, Minist Educ, Key Lab Adv Design & Intelligent Comp, Dalian, Peoples R China
来源
GENETICS AND MOLECULAR RESEARCH | 2012年 / 11卷 / 01期
关键词
Protein folding pattern; Ensemble classifier; Machine learning; Bioinformatics; CLASSIFICATION; DATABASE;
D O I
10.4238/2012.January.27.4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein folding is recognized as a critical problem in the field of biophysics in the 21st century. Predicting protein-folding patterns is challenging due to the complex structure of proteins. In an attempt to solve this problem, we employed ensemble classifiers to improve prediction accuracy. In our experiments, 188-dimensional features were extracted based on the composition and physical-chemical property of proteins and 20-dimensional features were selected using a coupled position-specific scoring matrix. Compared with traditional prediction methods, these methods were superior in terms of prediction accuracy. The 188-dimensional feature-based method achieved 71.2% accuracy in five cross-validations. The accuracy rose to 77% when we used a 20-dimensional feature vector. These methods were used on recent data, with 54.2% accuracy. Source codes and dataset, together with web server and software tools for prediction, are available at: http://datamining.xmu.edu.cn/main/similar to cwc/ProteinPredict.html.
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [1] Ensemble of classifiers for protein fold recognition
    Nanni, L
    NEUROCOMPUTING, 2006, 69 (7-9) : 850 - 853
  • [2] A novel ensemble of classifiers for protein fold recognition
    Nanni, Loris
    NEUROCOMPUTING, 2006, 69 (16-18) : 2434 - 2437
  • [3] Protein Fold Prediction Problem Using Ensemble of Classifiers
    Dehzangi, Abdollah
    Amnuaisuk, Somnuk Phon
    Ng, Keng Hoong
    Mohandesi, Ehsan
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 503 - 511
  • [4] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [5] Ensemble Classifiers with Improved Overfitting
    Pourtaheri, Zeinab Khatoun
    Zahiri, Seyed Hamid
    2016 1ST CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC 2016), 2016, : 93 - 97
  • [6] Comparing ensemble learning methods based on decision tree classifiers for protein fold recognition
    Bardsiri, Mahshid Khatibi
    Eftekhari, Mahdi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 9 (01) : 89 - 105
  • [7] Fusion of classifiers for protein fold recognition
    Nanni, L
    NEUROCOMPUTING, 2005, 68 : 315 - 321
  • [8] Improved multi-label classifiers for predicting protein subcellular localization
    Chen, Lei
    Qu, Ruyun
    Liu, Xintong
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 214 - 236
  • [9] An Ensemble Classifiers for Improved Prediction of Native-Non-Native Protein-Protein Interaction
    Pratiwi, Nor Kumalasari Caecar
    Tayara, Hilal
    Chong, Kil To
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (11)
  • [10] The effectiveness of ensemble learning based on four different classifiers for predicting membrane protein types
    Li, Mingyuan
    Wang, Shunfang
    Guo, Lei
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 5829 - 5834