Predicting protein secondary structure by an ensemble through feature-based accuracy estimation

被引:0
|
作者
Krieger, Spencer [1 ]
Kececioglu, John [1 ]
机构
[1] Univ Arizona, Comp Sci, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Protein secondary structure prediction; ensemble methods; feature-based accuracy estimation; method hybridization; NEURAL NETWORKS;
D O I
10.1145/3388440.3412425
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein secondary structure prediction is a fundamental task in computational biology, basic to many bioinformatics workflows, with a diverse collection of tools currently available. An approach from machine learning with the potential to capitalize on such a collection is ensemble prediction, which runs multiple predictors and combines their predictions into one, output by the ensemble. We conduct a thorough study of seven different approaches to ensemble secondary structure prediction, several of which are novel, and show we can indeed obtain an ensemble method that significantly exceeds the accuracy of individual state-of-the-art tools. The best approaches build on a recent technique known as feature-based accuracy estimation, which estimates the unknown true accuracy of a prediction, here using features of both the prediction output and the internal state of the prediction method. In particular, a hybrid approach to ensemble prediction that leverages accuracy estimation is now the most accurate method currently available: on average over standard CASP and PDB benchmarks, it exceeds the state-of-the-artQ3 accuracy for 3-state prediction by nearly 4%, and exceeds the Q8 accuracy for 8-state prediction by more than 8%. A preliminary implementation of our approach to ensemble protein secondary structure prediction, in a new tool we call Ssylla, is available free for non-commercial use at ssylla.cs.arizona.edu.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Manufacturing feature-based cost estimation of cast parts
    Muhammad Sajid
    Ahmad Wasim
    Salman Hussain
    Mirza Jahanzaib
    [J]. China Foundry, 2018, 15 : 464 - 469
  • [22] Feature-based stereological volume estimation in sectional images
    Toennies, KD
    Ozdoba, C
    [J]. PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 1468 - 1471
  • [23] WRINKLE FEATURE-BASED SKIN AGE ESTIMATION SCHEME
    Kim, Kyungrok
    Choi, Young-Hwan
    Hwang, Eenjun
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1222 - 1225
  • [24] Feature-based Head Pose Estimation from Images
    Vatahska, Teodora
    Bennewitz, Maren
    Behnke, Sven
    [J]. HUMANOIDS: 2007 7TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2007, : 330 - 335
  • [25] Feature-based ANN model for estimation of reference evapotranspiration
    Singh, PK
    Singh, KK
    Mathur, S
    Paras
    [J]. JOURNAL OF EXPERIMENTAL BOTANY, 2003, 54 : 41 - 41
  • [26] Feature-Based System for Cost Estimation in Production Networks
    Duerr, H.
    Tran, N. A.
    Loeser, C.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-4, 2009, : 1376 - 1380
  • [27] A feature-based ANN model for estimation of reference evapotranspiration
    Singh, PK
    Singh, KK
    Mathur, S
    Paras
    [J]. WATER-SAVING AGRICULTURE AND SUSTAINABLE USE OF WATER AND LAND RESOURCES, VOLS 1 AND 2, PROCEEDINGS, 2004, : 433 - 436
  • [28] Feature-Based Investigation of Simulation Structure and Behaviour
    Koch, Sandro
    Hamann, Eric
    Heinrich, Robert
    Reussner, Ralf
    [J]. SOFTWARE ARCHITECTURE, ECSA 2022, 2022, 13444 : 178 - 185
  • [29] ]A feature-based topological optimization for structure design
    Mei, Yulin
    Wang, Xiaoming
    Cheng, Gengdong
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2008, 39 (02) : 71 - 87
  • [30] Manufacturing feature-based cost estimation of cast parts
    Sajid, Muhammad
    Wasim, Ahmad
    Hussain, Salman
    Jahanzaib, Mirza
    [J]. CHINA FOUNDRY, 2018, 15 (06) : 464 - 469