GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm

被引:0
|
作者
Meher, Prabina Kumar [1 ,3 ]
Dash, Sagarika [2 ]
Sahu, Tanmaya Kumar [1 ]
Satpathy, Subhrajit [1 ]
Pradhan, Sukanta Kumar [2 ]
机构
[1] ICAR Indian Agr Stat Res Inst, New Delhi, India
[2] Orissa Univ Agr & Technol, Bhubaneswar, Odisha, India
[3] ICAR IASRI, Div Stat Genet, New Delhi 12, India
关键词
Circadian gene; Computational biology; Machine learning; Support vector machine; Proteome; F-BOX PROTEINS; SECONDARY STRUCTURE; CIRCADIAN CLOCK; WEB SERVER; ARABIDOPSIS; DOMAIN; GENE; EXPRESSION; IDENTIFICATION; DIMERIZATION;
D O I
10.1007/s12298-022-01130-6
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
In plants, GIGANTEA (GI) protein plays different biological functions including carbon and sucrose metabolism, cell wall deposition, transpiration and hypocotyl elongation. This suggests that GI is an important class of proteins. So far, the resource-intensive experimental methods have been mostly utilized for identification of GI proteins. Thus, we made an attempt in this study to develop a computational model for fast and accurate prediction of GI proteins. Ten different supervised learning algorithms i.e., SVM, RF, JRIP, J48, LMT, IBK, NB, PART, BAGG and LGB were employed for prediction, where the amino acid composition (AAC), FASGAI features and physicochemical (PHYC) properties were used as numerical inputs for the learning algorithms. Higher accuracies i.e., 96.75% of AUC-ROC and 86.7% of AUC-PR were observed for SVM coupled with AAC + PHYC feature combination, while evaluated with five-fold cross validation. With leave-one-out cross validation, 97.29% of AUC-ROC and 87.89% of AUC-PR were respectively achieved. While the performance of the model was evaluated with an independent dataset of 18 GI sequences, 17 were observed as correctly predicted. We have also performed proteome-wide identification of GI proteins in wheat, followed by functional annotation using Gene Ontology terms. A prediction server "GIpred'' is freely accessible at http://cabgrid.res.in:8080/gipred/ for proteome-wide recognition of GI proteins.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] Machine Learning Based Malaria Prediction Using KNN Algorithm
    Meenu, M.
    Subhasri, G.
    Mahima, R.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (02): : 38 - 42
  • [32] Human Age and Gender Prediction using Machine Learning Algorithm
    Hiremath, Jayaprada S.
    Patil, Shantakumar B.
    Patil, Premjyoti S.
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,
  • [33] Prediction of FBC Boiler Efficiency using Machine Learning Algorithm
    Pawar, Raunak
    Khandekar, M. A.
    Agashe, S. D.
    Makwana, Pratap
    2022 IEEE INTERNATIONAL POWER AND RENEWABLE ENERGY CONFERENCE, IPRECON, 2022,
  • [34] Visiting Time Prediction Using Machine Learning Regression Algorithm
    Hapsari, Indri
    Surjandari, Isti
    Komarudin
    2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 495 - 500
  • [35] Prediction of Sulfur Content in Copra Using Machine Learning Algorithm
    Sagayaraj, A. S.
    Devi, T. K.
    Umadevi, S.
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 2228 - 2245
  • [36] Blood Glucose Diabetic Prediction using Machine Learning Algorithm
    Devi, A.
    Sasireka, P.
    Kovardhani, K.
    Premalatha, G.
    Sivasakthi, T.
    Subash, R.
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1519 - 1522
  • [37] Japanese Encephalitis Symptom Prediction Using Machine Learning Algorithm
    Ranjan, Piyush
    Mishra, Sushruta
    Swain, Tridiv
    Sahoo, Kshira Sagar
    Lecture Notes in Networks and Systems, 2024, 728 LNNS : 99 - 112
  • [38] A forecasting tool for prediction of epileptic seizures using a machine learning approach
    Asharindavida, Fayas
    Hossain, M. Shamim
    Thacham, Azeemsha
    Khammari, Hedi
    Ahmed, Irfan
    Alraddady, Fahad
    Masud, Mehedi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (01):
  • [39] Prediction of liquid–liquid phase separating proteins using machine learning
    Xiaoquan Chu
    Tanlin Sun
    Qian Li
    Youjun Xu
    Zhuqing Zhang
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 23
  • [40] APOPLASTP: prediction of effectors and plant proteins in the apoplast using machine learning
    Sperschneider, Jana
    Dodds, Peter N.
    Singh, Karam B.
    Taylor, Jennifer M.
    NEW PHYTOLOGIST, 2018, 217 (04) : 1764 - 1778