Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition

被引:14
|
作者
Ibrahim, Wisam [1 ]
Abadeh, Mohammad Saniee [1 ]
机构
[1] Tarbiat Modares Univ, Fac Elect & Comp Engn, Tehran, Iran
关键词
Protein fold recognition; Extreme learning machine; Protein descriptor; Feature extraction; AMINO-ACID-COMPOSITION; LYSINE SUCCINYLATION SITES; GENERAL-FORM; ENSEMBLE CLASSIFIER; STRUCTURAL CLASSES; SUBCELLULAR-LOCALIZATION; PSEUDO COMPONENTS; DIFFERENT MODES; K-TUPLE; PREDICTION;
D O I
10.1016/j.jtbi.2017.03.023
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Learning Protein Embedding to Improve Protein Fold Recognition Using Deep Metric Learning
    Zhu, Guan-Yu
    Liu, Yan
    Wang, Peng-Hao
    Yang, Xibei
    Yu, Dong-Jun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (17) : 4283 - 4291
  • [2] Protein fold recognition using Deep Kernelized Extreme Learning Machine and linear discriminant analysis
    Ibrahim, Wisam
    Abadeh, Mohammad Saniee
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08): : 4201 - 4214
  • [3] Protein fold recognition using Deep Kernelized Extreme Learning Machine and linear discriminant analysis
    Wisam Ibrahim
    Mohammad Saniee Abadeh
    [J]. Neural Computing and Applications, 2019, 31 : 4201 - 4214
  • [4] Improving Protein Fold Recognition by Deep Learning Networks
    Jo, Taeho
    Hou, Jie
    Eickholt, Jesse
    Cheng, Jianlin
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [5] Improving Protein Fold Recognition by Deep Learning Networks
    Taeho Jo
    Jie Hou
    Jesse Eickholt
    Jianlin Cheng
    [J]. Scientific Reports, 5
  • [6] New techniques for extracting features from protein sequences
    Wang, JTL
    Ma, Q
    Shasha, D
    Wu, CH
    [J]. IBM SYSTEMS JOURNAL, 2001, 40 (02) : 426 - 441
  • [7] Extracting Coevolutionary Features from Protein Sequences for Predicting Protein-Protein Interactions
    Hu, Lun
    Chan, Keith C. C.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 155 - 166
  • [8] Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts
    Zhu, Jianwei
    Zhang, Haicang
    Li, Shuai Cheng
    Wang, Chao
    Kong, Lupeng
    Sun, Shiwei
    Zheng, Wei-Mou
    Bu, Dongbo
    [J]. BIOINFORMATICS, 2017, 33 (23) : 3749 - 3757
  • [9] A machine learning information retrieval approach to protein fold recognition
    Cheng, Jianlin
    Baldi, Pierre
    [J]. BIOINFORMATICS, 2006, 22 (12) : 1456 - 1463
  • [10] Relevance of Machine Learning Techniques and Various Protein Features in Protein Fold Classification: A Review
    Patil, Komal
    Chouhan, Usha
    [J]. CURRENT BIOINFORMATICS, 2019, 14 (08) : 688 - 697