GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm

被引:0
|
作者
Meher, Prabina Kumar [1 ,3 ]
Dash, Sagarika [2 ]
Sahu, Tanmaya Kumar [1 ]
Satpathy, Subhrajit [1 ]
Pradhan, Sukanta Kumar [2 ]
机构
[1] ICAR Indian Agr Stat Res Inst, New Delhi, India
[2] Orissa Univ Agr & Technol, Bhubaneswar, Odisha, India
[3] ICAR IASRI, Div Stat Genet, New Delhi 12, India
关键词
Circadian gene; Computational biology; Machine learning; Support vector machine; Proteome; F-BOX PROTEINS; SECONDARY STRUCTURE; CIRCADIAN CLOCK; WEB SERVER; ARABIDOPSIS; DOMAIN; GENE; EXPRESSION; IDENTIFICATION; DIMERIZATION;
D O I
10.1007/s12298-022-01130-6
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
In plants, GIGANTEA (GI) protein plays different biological functions including carbon and sucrose metabolism, cell wall deposition, transpiration and hypocotyl elongation. This suggests that GI is an important class of proteins. So far, the resource-intensive experimental methods have been mostly utilized for identification of GI proteins. Thus, we made an attempt in this study to develop a computational model for fast and accurate prediction of GI proteins. Ten different supervised learning algorithms i.e., SVM, RF, JRIP, J48, LMT, IBK, NB, PART, BAGG and LGB were employed for prediction, where the amino acid composition (AAC), FASGAI features and physicochemical (PHYC) properties were used as numerical inputs for the learning algorithms. Higher accuracies i.e., 96.75% of AUC-ROC and 86.7% of AUC-PR were observed for SVM coupled with AAC + PHYC feature combination, while evaluated with five-fold cross validation. With leave-one-out cross validation, 97.29% of AUC-ROC and 87.89% of AUC-PR were respectively achieved. While the performance of the model was evaluated with an independent dataset of 18 GI sequences, 17 were observed as correctly predicted. We have also performed proteome-wide identification of GI proteins in wheat, followed by functional annotation using Gene Ontology terms. A prediction server "GIpred'' is freely accessible at http://cabgrid.res.in:8080/gipred/ for proteome-wide recognition of GI proteins.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [21] Prediction of Phage Virion Proteins Using Machine Learning Methods
    Barman, Ranjan Kumar
    Chakrabarti, Alok Kumar
    Dutta, Shanta
    MOLECULES, 2023, 28 (05):
  • [22] Research on tool remaining useful life prediction algorithm based on machine learning
    Ge, Yong
    Teo, Hiu Hong
    Moey, Lip Kean
    Tayier, Walisijiang
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):
  • [23] Development and validation of a machine learning algorithm prediction for dense granule proteins in Apicomplexa
    Lu, Zhenxiao
    Hu, Hang
    Song, Yashan
    Zhou, Siyi
    Ayanniyi, Olalekan Opeyemi
    Xu, Qianming
    Yue, Zhenyu
    Yang, Congshan
    PARASITES & VECTORS, 2023, 16 (01)
  • [24] Development and validation of a machine learning algorithm prediction for dense granule proteins in Apicomplexa
    Zhenxiao Lu
    Hang Hu
    Yashan Song
    Siyi Zhou
    Olalekan Opeyemi Ayanniyi
    Qianming Xu
    Zhenyu Yue
    Congshan Yang
    Parasites & Vectors, 16
  • [25] Computational prediction of RNA tertiary structures using machine learning methods*
    Huang, Bin
    Du, Yuanyang
    Zhang, Shuai
    Li, Wenfei
    Wang, Jun
    Zhang, Jian
    CHINESE PHYSICS B, 2020, 29 (10)
  • [26] Computational Prediction of lncRNA-Protein Interactions using Machine learning
    Mushtaq, Muhammad
    Naveed, Hammad
    Khalid, Zoya
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2100 - 2103
  • [27] Computational prediction of RNA tertiary structures using machine learning methods
    黄斌
    杜渊洋
    张帅
    李文飞
    王骏
    张建
    Chinese Physics B, 2020, 29 (10) : 31 - 37
  • [28] Heart Disease Prediction Using Modified Machine Learning Algorithm
    Kaur, Bavneet
    Kaur, Gaganpreet
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 189 - 201
  • [29] Using Machine Learning Algorithm as a Method for Improving Stroke Prediction
    Alageel, Nojood
    Alharbi, Rahaf
    Alharbi, Rehab
    Alsayil, Maryam
    Tabuk, Lubna A. Alharbi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 738 - 744
  • [30] A framework for prediction of extrusion responses using machine learning algorithm
    Manohar, Grandhi
    Francy, K. Anupama
    Rao, Ch. Srinivasa
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2024,