Photosynthetic protein classification using genome neighborhood-based machine learning feature

被引:0
|
作者
Apiwat Sangphukieo
Teeraphan Laomettachit
Marasri Ruengjitchatchawalya
机构
[1] King Mongkut’s University of Technology Thonburi (KMUTT),Bioinformatics and Systems Biology Program, School of Bioresources and Technology
[2] KMUTT,Biotechnology program, School of Bioresources and Technology
[3] KMUTT,School of Information Technology
[4] Bang Mod,Algal Biotechnology Research Group
[5] Pilot Plant Development and Training Institute (PDTI),undefined
[6] KMUTT,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Identification of novel photosynthetic proteins is important for understanding and improving photosynthetic efficiency. Synergistically, genome neighborhood can provide additional useful information to identify photosynthetic proteins. We, therefore, expected that applying a computational approach, particularly machine learning (ML) with the genome neighborhood-based feature should facilitate the photosynthetic function assignment. Our results revealed a functional relationship between photosynthetic genes and their conserved neighboring genes observed by ‘Phylo score’, indicating their functions could be inferred from the genome neighborhood profile. Therefore, we created a new method for extracting patterns based on the genome neighborhood network (GNN) and applied them for the photosynthetic protein classification using ML algorithms. Random forest (RF) classifier using genome neighborhood-based features achieved the highest accuracy up to 87% in the classification of photosynthetic proteins and also showed better performance (Mathew’s correlation coefficient = 0.718) than other available tools including the sequence similarity search (0.447) and ML-based method (0.361). Furthermore, we demonstrated the ability of our model to identify novel photosynthetic proteins compared to the other methods. Our classifier is available at http://bicep2.kmutt.ac.th/photomod_standalone, https://bit.ly/2S0I2Ox and DockerHub: https://hub.docker.com/r/asangphukieo/photomod.
引用
收藏
相关论文
共 50 条
  • [31] A study of selective neighborhood-based naive Bayes for efficient lazy learning
    Xie, ZP
    Zhang, Q
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 758 - 760
  • [32] Perceptual tolerance neighborhood-based similarity in content-based image retrieval and classification
    Meghdadi, Amir H.
    Peters, James F.
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2012, 5 (02) : 164 - 185
  • [33] NCRL: Neighborhood-Based Collaborative Residual Learning for Adaptive QoS Prediction
    Zou, Guobing
    Wu, Shaogang
    Hu, Shengxiang
    Cao, Chenhong
    Gan, Yanglan
    Zhang, Bofeng
    Chen, Yixin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (03) : 2030 - 2043
  • [34] Surrounding neighborhood-based SMOTE for learning from imbalanced data sets
    García, V.
    Sánchez, J.S.
    Martín-Félez, R.
    Mollineda, R.A.
    Progress in Artificial Intelligence, 2012, 1 (04) : 347 - 362
  • [35] Neighborhood-based inference and restricted Boltzmann machine for microbe and drug associations prediction
    Cheng, Xiaolong
    Qu, Jia
    Song, Shuangbao
    Bian, Zekang
    PEERJ, 2022, 10
  • [36] TSFNFR: Two-stage fuzzy neighborhood-based feature reduction with binary whale optimization algorithm for imbalanced data classification
    Sun, Lin
    Wang, Xinya
    Ding, Weiping
    Xu, Jiucheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 256
  • [37] Neighborhood-based Collaborative Filtering Using Grey Relational Analysis
    Hu, Yi-Chung
    JOURNAL OF GREY SYSTEM, 2014, 26 (01): : 99 - 114
  • [38] Protein sequence classification using extreme learning machine
    Wang, DH
    Huang, GB
    Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 1406 - 1411
  • [39] Machine learning for multi-class protein fold classification based on neural networks with feature gating
    Huang, CD
    Chung, IF
    Pal, NR
    Lin, CT
    ARTIFICIAL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 1168 - 1175
  • [40] A machine learning-based feature extraction method for image classification using ResNet architecture
    Liao, Jing
    Guo, Linpei
    Jiang, Lei
    Yu, Chang
    Liang, Wei
    Li, Kuanching
    Pop, Florin
    Digital Signal Processing: A Review Journal, 2025, 160