gHPCSO: Gaussian Distribution Based Hybrid Particle Cat Swarm Optimization for Linear B-cell Epitope Prediction

被引:1
|
作者
Angaitkar P. [1 ]
Janghel R.R. [1 ]
Sahu T.P. [1 ]
机构
[1] Department of Information Technology, National Institute of Technology, Raipur, G.E. Road, C.G., Raipur
关键词
Antigen sequence; Cat Swarm Optimization (CSO); Feature extraction; Feature selection; Linear B-cell epitopes (LBCE); Particle Swarm Optimization (PSO); Protein sequence;
D O I
10.1007/s41870-023-01294-8
中图分类号
学科分类号
摘要
Linear B-cell epitope (LBCE) identification is critical in developing peptide-based vaccines, antibody production, and immuno-diagnosis. Laboratory experiments are costly and time-consuming for this endeavour. Therefore, it is required to develop computational techniques to predict LBCE. Many techniques have been developed, but none of them achieved the highest accuracy due to high-dimensional LBC data. High dimensional data leads to computational complexity, and the inclusion of all the features may not provide an accurate prediction. An effective feature selection method is required to select the most prominent features from the high dimensional dataset. This paper presents a novel feature selection method for LBCE classification which is named as Gaussian distribution-based Hybrid Particle Cat Swarm Optimization (gHPCSO). The gHPCSO solves the problem of local optima and low convergence of Particle Swarm Optimization (PSO) by using seeking and tracing mode of Cat Swarm Optimization (CSO) where particle position is updated through CSO. The Gaussian distribution is employed for the population initialization of particles to improve the convergence. The benchmark dataset for the experiments is collected from the IEDB protein bank (LBtope Fixed, bCPred, ABCPred16, and Chen). Different feature extraction techniques are used to create feature vectors. These extracted features are provided to gHPCSO which uses k-nearest neighbour (k-NN) to classify LBCE and non-epitopes. Precision, recall, F-measure, Mathews correlation coefficient (MCC), and accuracy are considered for evaluation purposes. State-of-the-art approaches are compared with gHPCSO where results justifies the superiority of gHPCSO. Friedman's test is used to evaluate the consistency of the gHPCSO. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:2805 / 2818
页数:13
相关论文
共 50 条
  • [1] B-cell Epitope Prediction Using Extreme Learning Machine and Particle Swarm Optimization-based Undersampling
    Jadid, Maral Arvanaghi
    Habibi, Mahnaz
    [J]. 2017 ARTIFICIAL INTELLIGENCE AND ROBOTICS (IRANOPEN), 2017, : 33 - 38
  • [2] BeeTLe: A Framework for Linear B-Cell Epitope Prediction and Classification
    Yuan, Xiao
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 477 - 494
  • [3] A Hybrid Global Optimization Algorithm Based on Particle Swarm Optimization and Gaussian Process
    Zhang, Yan
    Li, Hongyu
    Bao, Enhe
    Zhang, Lu
    Yu, Aiping
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 1270 - 1281
  • [4] A Hybrid Global Optimization Algorithm Based on Particle Swarm Optimization and Gaussian Process
    Yan Zhang
    Hongyu Li
    Enhe Bao
    Lu Zhang
    Aiping Yu
    [J]. International Journal of Computational Intelligence Systems, 2019, 12 : 1270 - 1281
  • [5] EpiDope: a deep neural network for linear B-cell epitope prediction
    Collatz, Maximilian
    Mock, Florian
    Barth, Emanuel
    Hoelzer, Martin
    Sachse, Konrad
    Marz, Manja
    [J]. BIOINFORMATICS, 2021, 37 (04) : 448 - 455
  • [6] Deep learning methods improve linear B-cell epitope prediction
    Liu, Tao
    Shi, Kaiwen
    Li, Wuju
    [J]. BIODATA MINING, 2020, 13 (01)
  • [7] Deep learning methods improve linear B-cell epitope prediction
    Tao Liu
    Kaiwen Shi
    Wuju Li
    [J]. BioData Mining, 13
  • [8] EPMLR: sequence-based linear B-cell epitope prediction method using multiple linear regression
    Lian, Yao
    Ge, Meng
    Pan, Xian-Ming
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [9] EPMLR: sequence-based linear B-cell epitope prediction method using multiple linear regression
    Yao Lian
    Meng Ge
    Xian-Ming Pan
    [J]. BMC Bioinformatics, 15
  • [10] Fault diagnosis of nodes in WSN based on particle swarm optimization and gaussian distribution
    [J]. Yu, C. (yuchengbo@cqut.edu.cn), 2013, Nanjing University of Aeronautics an Astronautics (33):