Multi-label classifier for protein sequence using heuristic-based deep convolution neural network

被引:0
|
作者
Vikas Chauhan
Aruna Tiwari
Niranjan Joshi
Sahaj Khandelwal
机构
[1] Indian Institute of Technology,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Multi-label classification; Genomics; Heuristic; Primary protein structure;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning techniques are found very useful to classify sequential data in recent times. The protein sequences belong to the functional classes based on the structure of their sequences. The annotation task of protein sequences into corresponding functional classes is multi-label in nature. The primary structure of protein contains a notable amount of vast data compared to the other secondary, tertiary, and quaternary structures. The clustering-based techniques require expert domain knowledge from the extensive data samples. Traditional methods use the n-gram features of amino acids while ignoring the relationship of motifs and amino acid sequence. This paper proposes an efficient method to classify the proteins into their functional classes using a convolution neural network based on heuristic rules. The proposed approach works on the primary structure of protein sequences which considers the relationship among motifs and amino acids. The proposed approach also takes into account the amino acid locations in the protein sequence. The proposed approach considers the affinity information between amino acids and motifs. Along with achieving high performance in the classification of protein sequences, we propose a heuristic approach to improve the precision and recall of the individual functional classes. The proposed heuristic approach improves the performance and handles the data imbalance problem. The proposed approach is compared with other competitive approaches, and our approach provides better performance metrics in terms of precision, recall, AUC, and subset accuracy. The greatest challenge with multi-label classification is to handle the data imbalance, which appears due to variance in frequencies of the labels in the data. This data imbalance is dealt with weight modulation in the loss function to influence the learning process.
引用
收藏
页码:2820 / 2837
页数:17
相关论文
共 50 条
  • [41] Near perfect perfect protein multi-label classification with deep neural networks
    Szalkai, Balazs
    Grolmusz, Vince
    [J]. METHODS, 2018, 132 : 50 - 56
  • [42] A label distance maximum-based classifier for multi-label learning
    Liu, Xiaoli
    Bao, Hang
    Zhao, Dazhe
    Cao, Peng
    [J]. BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1969 - S1976
  • [43] Multi-label classification of reduced-lead ECGs using an interpretable deep convolutional neural network
    Wickramasinghe, Nima L.
    Athif, Mohamed
    [J]. PHYSIOLOGICAL MEASUREMENT, 2022, 43 (06)
  • [44] Pairnorm based Graphical Convolution Network for zero-shot multi-label classification
    Chauhan, Vikas
    Tiwari, Aruna
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [45] Mixture Gases Classification Based on Multi-Label One-Dimensional Deep Convolutional Neural Network
    Zhao, Xiaojin
    Wen, Zhihuang
    Pan, Xiaofang
    Ye, Wenbin
    Bermak, Amine
    [J]. IEEE ACCESS, 2019, 7 : 12630 - 12637
  • [46] Multi-label convolutional neural network based pedestrian attribute classification
    Zhu, Jianqing
    Liao, Shengcai
    Lei, Zhen
    Li, Stan Z.
    [J]. IMAGE AND VISION COMPUTING, 2017, 58 : 224 - 229
  • [47] LaCova: A Tree-Based Multi-Label Classifier using Label Covariance as Splitting Criterion
    Al-Otaibi, Reem
    Kull, Meelis
    Flach, Peter
    [J]. 2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 74 - 79
  • [48] A novel bayesian network-based ensemble classifier chains for multi-label classification
    Wang, Zhenwu
    Zhang, Shiqi
    Chen, Yang
    Han, Mengjie
    Zhou, Yang
    Wan, Benting
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 7373 - 7399
  • [49] Multi-label Text Classification with Deep Neural Networks
    Chen, Yun
    Xiao, Bo
    Lin, Zhiqing
    Dai, Cheng
    Li, Zuochao
    Yang, Liping
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 409 - 413
  • [50] Deep concatenated features with improved heuristic-based recurrent neural network for hyperspectral image classification
    Marri Venkata Dasu
    P. Veera Narayana Reddy
    S. Chandra Mohan Reddy
    [J]. Multimedia Tools and Applications, 2024, 83 : 49875 - 49904