Identification of DNA-binding proteins by Kernel Sparse Representation via L2,1-matrix norm

被引:2
|
作者
Ming, Yutong [1 ]
Liu, Hongzhi [1 ]
Cui, Yizhi [1 ]
Guo, Shaoyong [3 ]
Ding, Yijie [2 ]
Liu, Ruijun [1 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Zhejiang, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
DNA -binding proteins; Machine learning; Kernel sparse representation -based classifica; tion; L; 2; 1-matrix norm; Evolutionary information features; AMINO-ACID-COMPOSITION; FACE RECOGNITION; PREDICTION; PSEAAC; INFORMATION;
D O I
10.1016/j.compbiomed.2023.106849
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
An understanding of DNA-binding proteins is helpful in exploring the role that proteins play in cell biology. Furthermore, the prediction of DNA-binding proteins is essential for the chemical modification and structural composition of DNA, and is of great importance in protein functional analysis and drug design. In recent years, DNA-binding protein prediction has typically used machine learning-based methods. The prediction accuracy of various classifiers has improved considerably, but researchers continue to spend time and effort on improving prediction performance. In this paper, we combine protein sequence evolutionary information with a classification method based on kernel sparse representation for the prediction of DNA-binding proteins, and based on the field of machine learning, a model for the identification of DNA-binding proteins by sequence information was finally proposed. Based on the confirmation of the final experimental results, we achieved good prediction accuracy on both the PDB1075 and PDB186 datasets. Our training result for cross-validation on PDB1075 was 81.37%, and our independent test result on PDB186 was 83.9%, both of which outperformed the other methods to some extent. Therefore, the proposed method in this paper is proven to be effective and feasible for predicting DNA-binding proteins.
引用
收藏
页数:9
相关论文
共 40 条
  • [21] Random Fourier features-based sparse representation classifier for identifying DNA-binding proteins
    Guo, Xiaoyi
    Tiwari, Prayag
    Zhang, Ying
    Han, Shuguang
    Wang, Yansu
    Ding, Yijie
    Computers in Biology and Medicine, 2022, 151
  • [22] Random Fourier features-based sparse representation classifier for identifying DNA-binding proteins
    Guo, Xiaoyi
    Tiwari, Prayag
    Zhang, Ying
    Han, Shuguang
    Wang, Yansu
    Ding, Yijie
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [23] Laplacian Regularized Sparse Representation Based Classifier for Identifying DNA N4-Methylcytosine Sites via L2, 1/2-Matrix Norm
    Ding, Yijie
    He, Wenying
    Tang, Jijun
    Zou, Quan
    Guo, Fei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 500 - 511
  • [24] IDENTIFICATION OF THE PROTEINS RESPONSIBLE FOR SAR DNA-BINDING IN MATRIX OF CUCURBITA-PEPO
    RZEPECKI, R
    MARKIEWICZ, E
    SZOPA, J
    ACTA BIOCHIMICA POLONICA, 1995, 42 (02) : 171 - 176
  • [25] L2,1-Norm Regularized Robust and Sparse Linear Discriminant Analysis via an Alternating Direction Method of Multipliers
    Li, Chun-Na
    Li, Yi
    Meng, Yan-Hui
    Ren, Pei-Wei
    Shao, Yuan-Hai
    IEEE ACCESS, 2023, 11 : 34250 - 34259
  • [26] Assessing Dry Weight of Hemodialysis Patients via Sparse Laplacian Regularized RVFL Neural Network with L2,1-Norm
    Guo, Xiaoyi
    Zhou, Wei
    Lu, Qun
    Du, Aiyan
    Cai, Yinghua
    Ding, Yijie
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [27] FTWSVM-SR: DNA-Binding Proteins Identification via Fuzzy Twin Support Vector Machines on Self-Representation
    Yi Zou
    Yijie Ding
    Li Peng
    Quan Zou
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 372 - 384
  • [28] FTWSVM-SR: DNA-Binding Proteins Identification via Fuzzy Twin Support Vector Machines on Self-Representation
    Zou, Yi
    Ding, Yijie
    Peng, Li
    Zou, Quan
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (02) : 372 - 384
  • [29] Identification of DNA N4-methylcytosine Sites via Multiview Kernel Sparse Representation Model
    Ai C.
    Tiwari P.
    Yang H.
    Ding Y.
    Tang J.
    Guo F.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1236 - 1245
  • [30] Identification of DNA-binding proteins via Multi-view LSSVM with independence criterion
    Zhao, Shulin
    Zhang, Yu
    Ding, Yijie
    Zou, Quan
    Tang, Lijia
    Liu, Qing
    Zhang, Ying
    METHODS, 2022, 207 : 29 - 37