Prediction of RNA-Binding Proteins by Voting Systems

被引:20
|
作者
Peng, C. R. [1 ,2 ]
Liu, L. [2 ]
Niu, B. [3 ]
Lv, Y. L. [4 ]
Li, M. J. [1 ]
Yuan, Y. L. [5 ]
Zhu, Y. B. [1 ]
Lu, W. C. [1 ]
Cai, Y. D. [6 ]
机构
[1] Shanghai Univ, Dept Chem, Coll Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Sch Mat Sci & Engn, Shanghai 2000721, Peoples R China
[3] Shanghai Univ, Coll Life Sci, Shanghai 200444, Peoples R China
[4] Univ Shanghai Sci & Technol Lib, Shanghai 200093, Peoples R China
[5] WuXi AppTec Co Ltd, Dept Synth, Shanghai 200131, Peoples R China
[6] Shanghai Univ, Inst Syst Biol, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
SUPPORT VECTOR MACHINES; AMINO-ACID-SEQUENCE; FEATURE-SELECTION; STRUCTURAL CLASSES; SITES; DNA; SERVER;
D O I
10.1155/2011/506205
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
It is important to identify which proteins can interact with RNA for the purpose of protein annotation, since interactions between RNA and proteins influence the structure of the ribosome and play important roles in gene expression. This paper tries to identify proteins that can interact with RNA using voting systems. Firstly through Weka, 34 learning algorithms are chosen for investigation. Then simple majority voting system (SMVS) is used for the prediction of RNA-binding proteins, achieving average ACC (overall prediction accuracy) value of 79.72% and MCC (Matthew's correlation coefficient) value of 59.77% for the independent testing dataset. Then mRMR (minimum redundancy maximum relevance) strategy is used, which is transferred into algorithm selection. In addition, the MCC value of each classifier is assigned to be the weight of the classifier's vote. As a result, best average MCC values are attained when 22 algorithms are selected and integrated through weighted votes, which are 64.70% for the independent testing dataset, and ACC value is 82.04% at this moment.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Computational Prediction of RNA-Binding Proteins and Binding Sites
    Si, Jingna
    Cui, Jing
    Cheng, Jin
    Wu, Rongling
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (11): : 26303 - 26317
  • [2] Refining the pool of RNA-binding domains advances the classification and prediction of RNA-binding proteins
    Wassmer, Elsa
    Koppany, Gergely
    Hermes, Malte
    Diederichs, Sven
    Caudron-Herger, Maiwen
    [J]. NUCLEIC ACIDS RESEARCH, 2024,
  • [3] PRBP: Prediction of RNA-Binding Proteins Using a Random Forest Algorithm Combined with an RNA-Binding Residue Predictor
    Ma, Xin
    Guo, Jing
    Xiao, Ke
    Sun, Xiao
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (06) : 1385 - 1393
  • [4] Prediction of dinucleotide-specific RNA-binding sites in proteins
    Fernandez, Michael
    Kumagai, Yutaro
    Standley, Daron M.
    Sarai, Akinori
    Mizuguchi, Kenji
    Ahmad, Shandar
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [5] RNA-binding proteins in RNA interference
    Kotelnikov R.N.
    Shpiz S.G.
    Kalmykova A.I.
    Gvozdev V.A.
    [J]. Molecular Biology, 2006, 40 (4) : 528 - 540
  • [6] Prediction of dinucleotide-specific RNA-binding sites in proteins
    Michael Fernandez
    Yutaro Kumagai
    Daron M Standley
    Akinori Sarai
    Kenji Mizuguchi
    Shandar Ahmad
    [J]. BMC Bioinformatics, 12
  • [7] Identification of RNA-Binding Proteins with Prognostic Prediction in Colorectal Cancer
    Bai, Xiao-fen
    Liu, Jing-wen
    [J]. BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [8] RNA-Binding Proteins in Cardiomyopathies
    Shi, De-Li
    [J]. JOURNAL OF CARDIOVASCULAR DEVELOPMENT AND DISEASE, 2024, 11 (03)
  • [9] Chloroplast RNA-binding proteins
    Jörg Nickelsen
    [J]. Current Genetics, 2003, 43 : 392 - 399
  • [10] Chloroplast RNA-binding proteins
    Sugita, M
    Sugiura, M
    [J]. NIPPON NOGEIKAGAKU KAISHI-JOURNAL OF THE JAPAN SOCIETY FOR BIOSCIENCE BIOTECHNOLOGY AND AGROCHEMISTRY, 1997, 71 (11): : 1177 - 1179