A Comprehensive Review on Machine Learning Techniques for Protein Family Prediction

被引:0
|
作者
Idhaya, T. [1 ]
Suruliandi, A. [1 ]
Raja, S. P. [2 ]
机构
[1] Manonmaniam Sundaranar Univ, Dept Comp Sci & Engn, Tirunelveli, Tamilnadu, India
[2] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamilnadu, India
来源
PROTEIN JOURNAL | 2024年 / 43卷 / 02期
关键词
Proteomics; Protein family; Machine learning; Sequence-homology; Alignment; CLASSIFICATION;
D O I
10.1007/s10930-024-10181-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteomics is a field dedicated to the analysis of proteins in cells, tissues, and organisms, aiming to gain insights into their structures, functions, and interactions. A crucial aspect within proteomics is protein family prediction, which involves identifying evolutionary relationships between proteins by examining similarities in their sequences or structures. This approach holds great potential for applications such as drug discovery and functional annotation of genomes. However, current methods for protein family prediction have certain limitations, including limited accuracy, high false positive rates, and challenges in handling large datasets. Some methods also rely on homologous sequences or protein structures, which introduce biases and restrict their applicability to specific protein families or structures. To overcome these limitations, researchers have turned to machine learning (ML) approaches that can identify connections between protein features and simplify complex high-dimensional datasets. This paper presents a comprehensive survey of articles that employ various ML techniques for predicting protein families. The primary objective is to explore and improve ML techniques specifically for protein family prediction, thus advancing future research in the field. Through qualitative and quantitative analyses of ML techniques, it is evident that multiple methods utilizing a range of classifiers have been applied for protein family prediction. However, there has been limited focus on developing novel classifiers for protein family classification, highlighting the urgent need for improved approaches in this area. By addressing these challenges, this research aims to enhance the accuracy and effectiveness of protein family prediction, ultimately facilitating advancements in proteomics and its diverse applications.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [1] A Comprehensive Review on Machine Learning Techniques for Protein Family Prediction
    T. Idhaya
    A. Suruliandi
    S. P. Raja
    [J]. The Protein Journal, 2024, 43 : 171 - 186
  • [2] A Comprehensive Review on Crop Disease Prediction Based on Machine Learning and Deep Learning Techniques
    Patil, Manoj A.
    Manohar, M.
    [J]. THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 481 - 503
  • [3] Machine learning techniques for prediction of capacitance and remaining useful life of supercapacitors: A comprehensive review
    Vaishali Sawant
    Rashmi Deshmukh
    Chetan Awati
    [J]. Journal of Energy Chemistry, 2023, 77 (02) : 438 - 451
  • [4] Machine learning techniques for prediction of capacitance and remaining useful life of supercapacitors: A comprehensive review
    Sawant, Vaishali
    Deshmukh, Rashmi
    Awati, Chetan
    [J]. JOURNAL OF ENERGY CHEMISTRY, 2023, 77 : 438 - 451
  • [5] Machine learning techniques for protein function prediction
    Bonetta, Rosalin
    Valentino, Gianluca
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2020, 88 (03) : 397 - 413
  • [6] A comprehensive review of machine learning techniques on diabetes detection
    Toshita Sharma
    Manan Shah
    [J]. Visual Computing for Industry, Biomedicine, and Art, 4
  • [7] A comprehensive review of model compression techniques in machine learning
    Dantas, Pierre Vilar
    da Silva Jr, Waldir Sabino
    Cordeiro, Lucas Carvalho
    Carvalho, Celso Barbosa
    [J]. APPLIED INTELLIGENCE, 2024, 54 (22) : 11804 - 11844
  • [8] A comprehensive review of machine learning techniques on diabetes detection
    Sharma, Toshita
    Shah, Manan
    [J]. VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2021, 4 (01)
  • [9] Comprehensive Study on Machine Learning Techniques for Software Bug Prediction
    Khleel, Nasraldeen Alnor Adam
    Nehez, Karoly
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 726 - 735
  • [10] Calibrating the classifier for protein family prediction with protein sequence using machine learning techniques: An empirical investigation
    Idhaya, T.
    Suruliandi, A.
    Calitoiu, Dragos
    Raja, S. P.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (03)