Protein Function Prediction Using Decision Trees

被引:2
|
作者
Yedida, Venkata Rama Kumar Swamy [1 ]
Chan, Chien-Chung [1 ]
Duan, Zhong-Hui [1 ]
机构
[1] Univ Akron, Dept Comp Sci, Akron, OH 44236 USA
关键词
D O I
10.1109/BIBMW.2008.4686235
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
We present an automated protein function prediction system that is based on a set of homologous proteins and gene ontology categories. A novel measure based on a set of optimal local alignments is used to identify the homologues. The biological functions of the homologous proteins are characterized with gene ontology annotations. The protein function prediction is performed based on data mining models using decision trees. The tree models depict the interconnections between biological functional groups, which reflect, in certain degree, the underlying biological pathways. The models are trained and tested using the complete proteome of model organism yeast (Sacchyromyces cerevisiae). The results of this study demonstrate the variations of model accuracy and prediction accuracy from one functional group to another. The variations illustrate certain limitations of sequence similarity based protein function prediction methods. However, basic assumption of similar sequences resulting similar functions is still largely valid The models developed outperform the methods that are solely depends on the annotations of homologous proteins, although the model is to be used as a preliminary tool for protein function prediction and the prediction results need to be verified through other means. The results show that the prediction accuracies for most of the functional groups are over 80%.
引用
收藏
页码:193 / 199
页数:7
相关论文
共 50 条
  • [1] Conformal Prediction Using Decision Trees
    Johansson, Ulf
    Bostrom, Henrik
    Lofstrom, Tuve
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 330 - 339
  • [2] Human Protein Function Prediction using Decision Tree Induction
    Singh, Manpreet
    Wadhwa, Parminder Kaur
    Sandhu, Parvinder Singh
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (04): : 92 - 98
  • [3] Business Failure Prediction using Decision Trees
    Gepp, Adrian
    Kumar, Kuldeep
    Bhattacharya, Sukanto
    [J]. JOURNAL OF FORECASTING, 2010, 29 (06) : 536 - 555
  • [4] Hard drive failure prediction using Decision Trees
    Li, Jing
    Stones, Rebecca J.
    Wang, Gang
    Liu, Xiaoguang
    Li, Zhongwei
    Xu, Ming
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 164 : 55 - 65
  • [5] Prediction and query evaluation using linguistic decision trees
    Qin, Zengchang
    Lawry, Jonathan
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (05) : 3916 - 3928
  • [6] Protein cellular localization prediction with support vector machines and decision trees
    Lorena, Ana Carolina
    de Carvalho, Andre C. P. L. F.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (02) : 115 - 125
  • [7] Prediction of back break in blasting using random decision trees
    Kumar, Shankar
    Mishra, A. K.
    Choudhary, B. S.
    [J]. ENGINEERING WITH COMPUTERS, 2022, 38 (SUPPL 2) : 1185 - 1191
  • [8] Intracranial hypertension prediction using extremely randomized decision trees
    Scalzo, Fabien
    Hamilton, Robert
    Asgari, Shadnaz
    Kim, Sunghan
    Hu, Xiao
    [J]. MEDICAL ENGINEERING & PHYSICS, 2012, 34 (08) : 1058 - 1065
  • [9] A Medical Price Prediction System using Hierarchical Decision Trees
    Tike, Anuja
    Tavarageri, Sanket
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3904 - 3913
  • [10] Using decision trees for agent modeling: Improving prediction performance
    Chiu, BC
    Webb, GI
    [J]. USER MODELING AND USER-ADAPTED INTERACTION, 1998, 8 (1-2) : 131 - 152