An Efficient Computational Intelligence Technique for Classification of Protein Sequences

被引:0
|
作者
Iqbal, Muhammad Javed [1 ]
Faye, Ibrahima [2 ]
Said, Abas Md [1 ]
Samir, Brahim Belhaouari [3 ]
机构
[1] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Tronoh, Malaysia
[2] Univ Teknol PETRONAS, Dept Fundamental & Appl Sci, Tronoh, Malaysia
[3] Alfaisal Univ, Coll Sci, Riyadh, Saudi Arabia
关键词
Bioinformatics; Feature encoding; Data mining; Superfamily; Protein classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many artificial intelligence techniques have been developed to process the constantly increasing volume of data to extract meaningful information from it. The accurate annotation of the unknown protein using the classification of the protein sequence into an existing superfamily is considered a critical and challenging task in bioinformatics and computational biology. This classification would be helpful in the analysis and modeling of unknown protein to determine their structure and function. In this paper, a frequency-based feature encoding technique has been used in the proposed framework to represent amino acids of a protein's primary sequence. The technique has considered the occurrence frequency of each amino acid in a sequence. Popular classification algorithms such as decision tree, naive Bayes, neural network, random forest and support vector machine have been employed to evaluate the effectiveness of the encoding method utilized in the proposed framework. Results have indicated that the decision tree classifier significantly shows better results in terms of classification accuracy, specificity, sensitivity, F-measure, etc. The classification accuracy of 88.7% was achieved over the Yeast protein sequence data taken from the well-known UniProtKB database.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Computational identification of MoRFs in protein sequences
    Malhis, Nawar
    Gsponer, Joerg
    BIOINFORMATICS, 2015, 31 (11) : 1738 - 1744
  • [22] Computational Intelligence Tools for Protein Modeling
    Kondabala, Rajesh
    Kumar, Vijay
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 949 - 956
  • [23] Hybrid Computational Intelligence Technique: Eczema Detection
    Arora, Yash Kumar
    Tandon, Amish
    Nijhawan, Rahul
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2472 - 2474
  • [24] Computational intelligence techniques for efficient delivery of healthcare
    Singh, Brijendra
    Acharjya, D. P.
    HEALTH AND TECHNOLOGY, 2020, 10 (01) : 167 - 185
  • [25] Computational intelligence techniques for efficient delivery of healthcare
    Brijendra Singh
    D. P. Acharjya
    Health and Technology, 2020, 10 : 167 - 185
  • [26] AN EFFICIENT TECHNIQUE FOR LITHOLOGY CLASSIFICATION
    ELSHEIKH, TS
    SYIAM, MM
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1989, 27 (05): : 629 - 632
  • [27] Computational intelligence - A broad initiative in automated learning from sequences
    Yang, Mary Qu
    Yang, Jack Y.
    Ersoy, Okan K.
    2005 ICSC CONGRESS ON COMPUTATIONAL INTELLIGENCE METHODS AND APPLICATIONS (CIMA 2005), 2005, : 153 - 158
  • [28] Efficient computational technique for virtual partitioning
    Bziuk, W.
    IET CIRCUITS DEVICES & SYSTEMS, 2008, 2 (01) : 39 - 49
  • [29] Incremental learning for classification of protein sequences
    Mohamed, Shakir
    Rubin, David
    Marwala, Tshilidzi
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 19 - 24