Identification of DNA-binding proteins by Kernel Sparse Representation via L2,1-matrix norm

被引:1
|
作者
Ming, Yutong [1 ]
Liu, Hongzhi [1 ]
Cui, Yizhi [1 ]
Guo, Shaoyong [3 ]
Ding, Yijie [2 ]
Liu, Ruijun [1 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Zhejiang, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
DNA -binding proteins; Machine learning; Kernel sparse representation -based classifica; tion; L; 2; 1-matrix norm; Evolutionary information features; AMINO-ACID-COMPOSITION; FACE RECOGNITION; PREDICTION; PSEAAC; INFORMATION;
D O I
10.1016/j.compbiomed.2023.106849
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
An understanding of DNA-binding proteins is helpful in exploring the role that proteins play in cell biology. Furthermore, the prediction of DNA-binding proteins is essential for the chemical modification and structural composition of DNA, and is of great importance in protein functional analysis and drug design. In recent years, DNA-binding protein prediction has typically used machine learning-based methods. The prediction accuracy of various classifiers has improved considerably, but researchers continue to spend time and effort on improving prediction performance. In this paper, we combine protein sequence evolutionary information with a classification method based on kernel sparse representation for the prediction of DNA-binding proteins, and based on the field of machine learning, a model for the identification of DNA-binding proteins by sequence information was finally proposed. Based on the confirmation of the final experimental results, we achieved good prediction accuracy on both the PDB1075 and PDB186 datasets. Our training result for cross-validation on PDB1075 was 81.37%, and our independent test result on PDB186 was 83.9%, both of which outperformed the other methods to some extent. Therefore, the proposed method in this paper is proven to be effective and feasible for predicting DNA-binding proteins.
引用
收藏
页数:9
相关论文
共 39 条
  • [1] Sparse matrix factorization with L2,1 norm for matrix completion
    Jin, Xiaobo
    Miao, Jianyu
    Wang, Qiufeng
    Geng, Guanggang
    Huang, Kaizhu
    PATTERN RECOGNITION, 2022, 127
  • [2] Image Restoration via Group l2,1 Norm-Based Structural Sparse Representation
    Zhang, Kai Song
    Zhong, Luo
    Zhang, Xuan Ya
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (04)
  • [3] Discriminative Kernel Transfer Learning via l2,1- Norm Minimization
    Zhang, Lei
    Jha, Sunil Kr.
    Liu, Tao
    Pei, Guangshu
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 2220 - 2227
  • [4] Sparse Neighborhood Preserving Embedding via L2,1-Norm Minimization
    Zhou, Youpeng
    Ding, Yulin
    Luo, Yifu
    Ren, Haoliang
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 378 - 382
  • [5] Identification of Extracellular DNA-Binding Proteins in the Biofilm Matrix
    Kavanaugh, Jeffrey S.
    Flack, Caralyn E.
    Lister, Jessica
    Ricker, Erica B.
    Ibberson, Carolyn B.
    Jenul, Christian
    Moormeier, Derek E.
    Delmain, Elizabeth A.
    Bayles, Kenneth W.
    Horswill, Alexander R.
    MBIO, 2019, 10 (03):
  • [6] Unsupervised Discriminative Feature Selection in a Kernel Space via L2,1-Norm Minimization
    Liu, Yang
    Wang, Yizhou
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1205 - 1208
  • [7] Adaptive Neighborhood Propagation by Joint L2,1-norm Regularized Sparse Coding for Representation and Classification
    Jia, Lei
    Zhang, Zhao
    Wang, Lei
    Jiang, Weiming
    Zhao, Mingbo
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 201 - 210
  • [8] Characteristic Gene Selection via L2,1-norm Sparse Principal Component Analysis
    Lu, Yao
    Ga, Ying-Lian
    Liu, Jin-Xing
    Wen, Chang-Gang
    Wang, Ya-Xuan
    Yu, Jiguo
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1828 - 1833
  • [9] Identification of DNA-Binding Proteins via a Voting Strategy
    Zhang, Jun
    Liu, Bin
    CURRENT PROTEOMICS, 2018, 15 (05) : 363 - 373
  • [10] Face recognition by sparse discriminant analysis via joint L2,1-norm minimization
    Shi, Xiaoshuang
    Yang, Yujiu
    Guo, Zhenhua
    Lai, Zhihui
    PATTERN RECOGNITION, 2014, 47 (07) : 2447 - 2453