Multi-label classifier for protein sequence using heuristic-based deep convolution neural network

被引:0
|
作者
Vikas Chauhan
Aruna Tiwari
Niranjan Joshi
Sahaj Khandelwal
机构
[1] Indian Institute of Technology,Department of Computer Science and Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Multi-label classification; Genomics; Heuristic; Primary protein structure;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning techniques are found very useful to classify sequential data in recent times. The protein sequences belong to the functional classes based on the structure of their sequences. The annotation task of protein sequences into corresponding functional classes is multi-label in nature. The primary structure of protein contains a notable amount of vast data compared to the other secondary, tertiary, and quaternary structures. The clustering-based techniques require expert domain knowledge from the extensive data samples. Traditional methods use the n-gram features of amino acids while ignoring the relationship of motifs and amino acid sequence. This paper proposes an efficient method to classify the proteins into their functional classes using a convolution neural network based on heuristic rules. The proposed approach works on the primary structure of protein sequences which considers the relationship among motifs and amino acids. The proposed approach also takes into account the amino acid locations in the protein sequence. The proposed approach considers the affinity information between amino acids and motifs. Along with achieving high performance in the classification of protein sequences, we propose a heuristic approach to improve the precision and recall of the individual functional classes. The proposed heuristic approach improves the performance and handles the data imbalance problem. The proposed approach is compared with other competitive approaches, and our approach provides better performance metrics in terms of precision, recall, AUC, and subset accuracy. The greatest challenge with multi-label classification is to handle the data imbalance, which appears due to variance in frequencies of the labels in the data. This data imbalance is dealt with weight modulation in the loss function to influence the learning process.
引用
收藏
页码:2820 / 2837
页数:17
相关论文
共 50 条
  • [1] Multi-label classifier for protein sequence using heuristic-based deep convolution neural network
    Chauhan, Vikas
    Tiwari, Aruna
    Joshi, Niranjan
    Khandelwal, Sahaj
    [J]. APPLIED INTELLIGENCE, 2022, 52 (03) : 2820 - 2837
  • [2] A Deep Neural Network Based Hierarchical Multi-Label Classifier for Protein Function Prediction
    Yuan, Xin
    Li, Weite
    Lin, Kui
    Hu, Jinglu
    [J]. PROCEEDING OF THE 2019 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (IEEE CITS 2019), 2019, : 131 - 135
  • [3] A Neural Network-Based Multi-Label Classifier for Protein Function Prediction
    Tahzeeb, Shahab
    Hasan, Shehzad
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (01) : 7974 - 7981
  • [4] Deep Convolution Neural Network sharing for the multi-label images classification
    Coulibaly, Solemane
    Kamsu-Foguem, Bernard
    Kamissoko, Dantouma
    Traore, Daouda
    [J]. Machine Learning with Applications, 2022, 10
  • [5] Multi-Label Classification of Microblogging Texts Using Convolution Neural Network
    Parwez, Md Aslam
    Abulaish, Muhammad
    Jahiruddin
    [J]. IEEE ACCESS, 2019, 7 : 68678 - 68691
  • [6] Multi-Label Classification using Deep Convolutional Neural Network
    Lydia, A. Agnes
    Francis, E. Sagayaraj
    [J]. 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2020,
  • [7] A deep neural network based hierarchical multi-label classification method
    Feng, Shou
    Zhao, Chunhui
    Fu, Ping
    [J]. REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (02):
  • [8] Convolution Neural Network Based Multi-Label Disease Detection Using Smartphone Captured Tongue Images
    Bhatnagar, Vibha
    Bansod, Prashant P.
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [9] Multi-label classification of technical articles based on deep neural network
    Zhao, Qiuhan
    Yang, Wenchuan
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8391 - 8397
  • [10] A Deep Neural Network-Based Multi-Label Classifier for SLA Violation Prediction in a Latency Sensitive NFV Application
    Jalodia, Nikita
    Taneja, Mohit
    Davy, Alan
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 2469 - 2493