Prediction of protein subcellular localization using machine learning with novel use of generic feature set

被引:0
|
作者
Upama, Paramita Basak [1 ]
Tanny, Nawshin Tabassum [1 ]
Akhter, Shahin [2 ]
机构
[1] Eastern Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangladesh Univ Engn & Technol, Inst Informat & Commun Technol, Dhaka, Bangladesh
关键词
Protein; Subcellular localization; Support Vector Machine; Machine Learning; Feature Selection; LOCATIONS;
D O I
10.1109/WIECON-ECE52138.2020.9397976
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The method of identifying the location of protein within a cell is called subcellular localization of proteins. This area of research in Bioinformatics is pivotal for protein synthesis and drug discovery of several medical conditions and diseases. This paper introduces a new machine learning approach for subcellular localization of proteins, which used 18 basic and physicochemical features novel for such methods. A model with support vector machine (SVM) was developed at first to learn these properties of proteins from 6 locations inside a cell, and then test the model on another independent set of protein sequences. The proposed multi-class classification algorithm achieved an accuracy of about 94%. The results show superior performance with minimal computations when compared to similar algorithms in the literature.
引用
收藏
页码:98 / 101
页数:4
相关论文
共 50 条
  • [21] Prediction of apoptosis protein subcellular localization via heterogeneous features and hierarchical extreme learning machine
    Zhang, S.
    Zhang, T.
    Liu, C.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2019, 30 (03) : 209 - 228
  • [22] Feature Fusion Based SVM Classifier for Protein Subcellular Localization Prediction
    Rahman, Julia
    Mondal, Md. Nazrul Islam
    Ben Islam, Md. Khaled
    Al Mehedi, Md.
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (01) : 288
  • [23] Prediction of protein subcellular localization
    Yu, Chin-Sheng
    Chen, Yu-Ching
    Lu, Chih-Hao
    Hwang, Jenn-Kang
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (03) : 643 - 651
  • [24] Using motifs in the prediction of eukaryotic protein subcellular localization
    Xie, Dan
    Li, Ao
    Lin, Xiaojun
    Wang, Minghui
    Jiang, Zhaohui
    Feng, Huanqing
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2802 - 2804
  • [25] Improvement of MicroRNA Target Prediction Using An Enhanced Feature Set: A Machine Learning Approach
    Mitra, Ramkrishna
    Bandyopadhyay, Sanghamitra
    [J]. 2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 428 - 433
  • [26] mGOF-loc: A novel ensemble learning method for human protein subcellular localization prediction
    Wei, Leyi
    Liao, Minghong
    Gao, Xing
    Wang, Jingjing
    Lin, Weiqi
    [J]. NEUROCOMPUTING, 2016, 217 : 73 - 82
  • [27] A novel algorithm combining support vector machine with the discrete wavelet transform for the prediction of protein subcellular localization
    Liang, Ru-Ping
    Huang, Shu-Yun
    Shi, Shao-Ping
    Sun, Xing-Yu
    Suo, Sheng-Bao
    Qiu, Jian-Ding
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (02) : 180 - 187
  • [28] A comparative study on feature extraction from protein sequences for subcellular localization prediction
    Yang, Wen-Yun
    Lu, Bao-Liang
    Yang, Yang
    [J]. PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 201 - +
  • [29] Prediction of protein subcellular location using a combined feature of sequence
    Gao, QB
    Wang, ZZ
    Yan, C
    Du, YH
    [J]. FEBS LETTERS, 2005, 579 (16): : 3444 - 3448
  • [30] Protein subcellular localization prediction tools
    Gillani, Maryam
    Pollastri, Gianluca
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1796 - 1807