Photosynthetic protein classification using genome neighborhood-based machine learning feature

被引:0
|
作者
Apiwat Sangphukieo
Teeraphan Laomettachit
Marasri Ruengjitchatchawalya
机构
[1] King Mongkut’s University of Technology Thonburi (KMUTT),Bioinformatics and Systems Biology Program, School of Bioresources and Technology
[2] KMUTT,Biotechnology program, School of Bioresources and Technology
[3] KMUTT,School of Information Technology
[4] Bang Mod,Algal Biotechnology Research Group
[5] Pilot Plant Development and Training Institute (PDTI),undefined
[6] KMUTT,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Identification of novel photosynthetic proteins is important for understanding and improving photosynthetic efficiency. Synergistically, genome neighborhood can provide additional useful information to identify photosynthetic proteins. We, therefore, expected that applying a computational approach, particularly machine learning (ML) with the genome neighborhood-based feature should facilitate the photosynthetic function assignment. Our results revealed a functional relationship between photosynthetic genes and their conserved neighboring genes observed by ‘Phylo score’, indicating their functions could be inferred from the genome neighborhood profile. Therefore, we created a new method for extracting patterns based on the genome neighborhood network (GNN) and applied them for the photosynthetic protein classification using ML algorithms. Random forest (RF) classifier using genome neighborhood-based features achieved the highest accuracy up to 87% in the classification of photosynthetic proteins and also showed better performance (Mathew’s correlation coefficient = 0.718) than other available tools including the sequence similarity search (0.447) and ML-based method (0.361). Furthermore, we demonstrated the ability of our model to identify novel photosynthetic proteins compared to the other methods. Our classifier is available at http://bicep2.kmutt.ac.th/photomod_standalone, https://bit.ly/2S0I2Ox and DockerHub: https://hub.docker.com/r/asangphukieo/photomod.
引用
下载
收藏
相关论文
共 50 条
  • [41] Machine-Learning-Based Hotspot Detection Using Topological Classification and Critical Feature Extraction
    Yu, Yen-Ting
    Lin, Geng-He
    Jiang, Iris Hui-Ru
    Chiang, Charles
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (03) : 460 - 470
  • [42] Water agricultural management based on hydrology using machine learning techniques for feature extraction and classification
    Yi-Chia Lin
    Almuhannad Sulaiman Alorfi
    Tawfiq Hasanin
    Mahendran Arumugam
    Roobaea Alroobaea
    Majed Alsafyani
    Wael Y. Alghamdi
    Acta Geophysica, 2024, 72 : 1945 - 1955
  • [43] Heart Diseases Prediction for Optimization based Feature Selection and Classification using Machine Learning Methods
    Rajinikanth, N.
    Pavithra, L.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 636 - 643
  • [44] Machine-Learning-Based Hotspot Detection Using Topological Classification and Critical Feature Extraction
    Yu, Yen-Ting
    Lin, Geng-He
    Jiang, Iris Hui-Ru
    Chiang, Charles
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [45] Regularization based discriminative feature pattern selection for the classification of Parkinson cases using machine learning
    Kaliyan, Kamalakannan
    Ganesan, Anandharaj
    BIO-ALGORITHMS AND MED-SYSTEMS, 2021, 17 (03) : 181 - 189
  • [46] The Classification of Medicinal Plant Leaves Based on Multispectral and Texture Feature Using Machine Learning Approach
    Naeem, Samreen
    Ali, Aqib
    Chesneau, Christophe
    Tahir, Muhammad H.
    Jamal, Farrukh
    Sherwani, Rehan Ahmad Khan
    Ul Hassan, Mahmood
    AGRONOMY-BASEL, 2021, 11 (02):
  • [47] Water agricultural management based on hydrology using machine learning techniques for feature extraction and classification
    Lin, Yi-Chia
    Alorfi, Almuhannad Sulaiman
    Hasanin, Tawfiq
    Arumugam, Mahendran
    Alroobaea, Roobaea
    Alsafyani, Majed
    Alghamdi, Wael Y.
    ACTA GEOPHYSICA, 2024, 72 (03) : 1945 - 1955
  • [48] Classification of lung cancer using ensemble-based feature selection and machine learning methods
    Cai, Zhihua
    Xu, Dong
    Zhang, Qing
    Zhang, Jiexia
    Ngai, Sai-Ming
    Shao, Jianlin
    MOLECULAR BIOSYSTEMS, 2015, 11 (03) : 791 - 800
  • [49] Image feature-based electric vehicle detection and classification system using machine learning
    Kim S.
    Kang S.-J.
    Kang, Suk-Ju (sjkang@sogang.ac.kr), 1600, Korean Institute of Electrical Engineers (66): : 1092 - 1099
  • [50] Facial geometric feature extraction based emotional expression classification using machine learning algorithms
    Murugappan, M.
    Mutawa, A.
    PLOS ONE, 2021, 16 (02):