Exploring Machine Learning Techniques to Improve Peptide Identification

被引:2
|
作者
Kirmani, Fawad [1 ]
Lane, Bryan Jeremy [1 ]
Rose, John R. [1 ]
机构
[1] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
关键词
D O I
10.1109/BIBE.2019.00021
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Proteotypic peptides are the peptides in protein sequences that can be confidently observed by mass-spectrometry based proteomics. In recent years, there has been an increased effort to use proteotypic peptide prediction to improve the accuracy of peptide identification. These investigations compile various physicochemical peptide features to identify whether peptides are proteotypic. Here we describe our method for the selection, reduction and evaluation of physicochemical features for proteotypic peptide prediction. We performed feature selection on a published set of features and identified six features as the most significant. To highlight the effectiveness of our reduced feature set, we trained three machine learning algorithms (support vector machines, random forests, and XGBoost) as proteotypic peptide identifiers. Importantly, for larger data sets, the random forests and XGBoost algorithms trained faster than the support vector machine, as solving the support vector machine objective function requires quadratic programming. Our three classifiers had similar if not better prediction accuracy when compared to other proteotypic peptide predictors on the same data sets.
引用
收藏
页码:66 / 71
页数:6
相关论文
共 50 条
  • [1] Exploring Machine Learning Techniques for Identification of Cues for Robot Navigation with a LIDAR Scanner
    Bieszczad, Aj
    [J]. ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 645 - 652
  • [2] Exploring Machine Learning Techniques for Fault Localization
    Ascari, Luciano C.
    Araki, Lucilia Y.
    Pozo, Aurora R. T.
    Vergilio, Silvia R.
    [J]. LATW: 2009 10TH LATIN AMERICAN TEST WORKSHOP, 2009, : 37 - 42
  • [3] Sparse Fuzzy Techniques Improve Machine Learning
    Sanchez, Reinaldo
    Servin, Christian
    Argaez, Miguel
    [J]. PROCEEDINGS OF THE 2013 JOINT IFSA WORLD CONGRESS AND NAFIPS ANNUAL MEETING (IFSA/NAFIPS), 2013, : 531 - 535
  • [4] Machine learning for antimicrobial peptide identification and design
    Fangping Wan
    Felix Wong
    James J. Collins
    Cesar de la Fuente-Nunez
    [J]. Nature Reviews Bioengineering, 2024, 2 (5): : 392 - 407
  • [5] Exploring Machine Learning techniques for Smart Drainage System
    Chen, Changhua
    Pang, Yan
    [J]. 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 63 - 70
  • [6] Exploring machine learning techniques for software size estimation
    Regolin, EN
    de Souza, GA
    Pozo, ART
    Vergilio, SR
    [J]. SCCC 2003: XXIII INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2003, : 130 - 136
  • [7] Paraphrase Identification using Machine Learning Techniques
    Chitra, A.
    Kumar, C. S. Saravana
    [J]. RECENT ADVANCES IN NETWORKING, VLSI AND SIGNAL PROCESSING, 2010, : 245 - +
  • [8] Exploring Mode Identification in Irish Folk Music with Unsupervised Machine Learning and Template-Based Techniques
    Navarro-Caceres, Juan Jose
    Carvalho, Nadia
    Bernardes, Gilberto
    Jimenez-Bravo, Diego M.
    Navarro-Caceres, Maria
    [J]. MATHEMATICS AND COMPUTATION IN MUSIC, MCM 2024, 2024, 14639 : 412 - 420
  • [9] The identification and localization of speaker using fusion techniques and machine learning techniques
    Rasha H. Ali
    Mohammed Najm Abdullah
    Buthainah F. Abed
    [J]. Evolutionary Intelligence, 2024, 17 : 133 - 149
  • [10] The identification and localization of speaker using fusion techniques and machine learning techniques
    Ali, Rasha H.
    Abdullah, Mohammed Najm
    Abed, Buthainah F.
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 133 - 149