A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation

被引:164
|
作者
Dong, Qiwen [1 ,2 ]
Zhou, Shuigeng [1 ,2 ]
Guan, Jihong [3 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
REMOTE HOMOLOGY DETECTION; STRUCTURE PREDICTION; ENSEMBLE CLASSIFIER; SECONDARY STRUCTURE; COUPLED RECEPTORS; INFORMATION; SEQUENCES; IDENTIFICATION; PROFILES; DATABASE;
D O I
10.1093/bioinformatics/btp500
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Fold recognition is an important step in protein structure and function prediction. Traditional sequence comparison methods fail to identify reliable homologies with low sequence identity, while the taxonomic methods are effective alternatives, but their prediction accuracies are around 70%, which are still relatively low for practical usage. Results: In this study, a simple and powerful method is presented for taxonomic fold recognition, which combines support vector machine (SVM) with autocross-covariance (ACC) transformation. The evolutionary information represented in the form of position-specific score matrices is converted into a series of fixed-length vectors by ACC transformation and these vectors are then input to a SVM classifier for fold recognition. The sequence-order effect can be effectively captured by this scheme. Experiments are performed on the widely used D-B dataset and the corresponding extended dataset, respectively. The proposed method, called ACCFold, gets an overall accuracy of 70.1% on the D-B dataset, which is higher than major existing taxonomic methods by 2-14%. Furthermore, the method achieves an overall accuracy of 87.6% on the extended dataset, which surpasses major existing taxonomic methods by 9-17%. Additionally, our method obtains an overall accuracy of 80.9% for 86-folds and 77.2% for 199-folds. These results demonstrate that the ACCFold method provides the state-of-the-art performance for taxonomic fold recognition.
引用
收藏
页码:2655 / 2662
页数:8
相关论文
共 50 条
  • [1] Improving taxonomy-based protein fold recognition by using global and local features
    Yang, Jian-Yi
    Chen, Xin
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (07) : 2053 - 2064
  • [2] Predicting G-protein coupled receptors-G-protein coupling specificity based on autocross-covariance transform
    Guo, Yanzhi
    Li, Menglong
    Lu, Minchun
    Wen, Zhining
    Huang, Zhongtian
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 65 (01) : 55 - 60
  • [3] The Web Knowledge Management: A Taxonomy-Based Approach
    Pani, Filippo Eros
    Lunesu, Maria Ilaria
    Concas, Giulio
    Baralla, Gavina
    [J]. KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, IC3K 2013, 2015, 454 : 230 - 244
  • [4] Fish Ontology framework for taxonomy-based fish recognition
    Ali, Najib M.
    Khan, Haris A.
    Then, Amy Y-Hui
    Ching, Chong Ving
    Gaur, Manas
    Dhillon, Sarinder Kaur
    [J]. PEERJ, 2017, 5
  • [5] A new approach to protein fold recognition based on Delaunay tessellation of protein structure
    Zheng, W
    Cho, SJ
    Vaisman, II
    Tropsha, A
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING '97, 1996, : 486 - 497
  • [6] Human Action Recognition: A Taxonomy-Based Survey, Updates, and Opportunities
    Morshed, Md Golam
    Sultana, Tangina
    Alam, Aftab
    Lee, Young-Koo
    [J]. SENSORS, 2023, 23 (04)
  • [7] A new approach to protein fold recognition based on delaunay tessellation of protein structure.
    Tropsha, A
    Vaisman, II
    Cho, SJ
    Zheng, W
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1996, 212 : 71 - COMP
  • [8] A NEW APPROACH TO PROTEIN FOLD RECOGNITION
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    [J]. NATURE, 1992, 358 (6381) : 86 - 89
  • [9] A Taxonomy-based Approach for Security in Software-Defined Networking
    Banse, Christian
    Schuette, Julian
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [10] A taxonomy-based approach to determining generic interestingness of association rules
    Natarajan, R
    Shekar, B
    [J]. IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 703 - 707