Improved DNA-Binding Protein Identification by Incorporating Evolutionary Information Into the Chou's PseAAC
被引:30
|
作者:
Fu, Xiangzheng
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Fu, Xiangzheng
[1
]
Zhu, Wen
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Hainan Normal Univ, Sch Math & Stat, Haikou 570100, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Zhu, Wen
[1
,2
]
Liao, Bo
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Hainan Normal Univ, Sch Math & Stat, Haikou 570100, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Liao, Bo
[1
,2
]
Cai, Lijun
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Cai, Lijun
[1
]
Peng, Lihong
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ Technol, Sch Comp Sci, Zhuzhou 412007, Peoples R ChinaHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Peng, Lihong
[3
]
Yang, Jialiang
论文数: 0引用数: 0
h-index: 0
机构:
Hainan Normal Univ, Sch Math & Stat, Haikou 570100, Peoples R China
Icahn Sch Med Mt Sinai, Icahn Inst Genom & Multiscale Biol, New York, NY 10029 USAHunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
Yang, Jialiang
[2
,4
]
机构:
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] Hainan Normal Univ, Sch Math & Stat, Haikou 570100, Peoples R China
[3] Hunan Univ Technol, Sch Comp Sci, Zhuzhou 412007, Peoples R China
[4] Icahn Sch Med Mt Sinai, Icahn Inst Genom & Multiscale Biol, New York, NY 10029 USA
来源:
IEEE ACCESS
|
2018年
/
6卷
关键词:
DNA-binding protein identification;
feature representation algorithm;
evolutionary information;
support vector machine;
AMINO-ACID-COMPOSITION;
PREDICT SUBCELLULAR-LOCALIZATION;
ENSEMBLE CLASSIFIER;
WEB SERVER;
SEQUENCE;
SITES;
RNA;
BIOINFORMATICS;
GENERATION;
PROMOTERS;
D O I:
10.1109/ACCESS.2018.2876656
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
DNA-binding proteins play critical roles in various cellular biological processes, such as gene expression and transcription. However, the experimental methods to identify these proteins like ChIP-sequencing are expensive and time-consuming, which presents the need for in silico methods, especially machine learning-based methods. In recent years, the accuracy of machine learning-based DNA-binding protein prediction has been increasing significantly. However, there are still some critical problems to be solved like how to convert protein sequences into an appropriate discrete model or vector. In this paper, we propose a novel feature construction method based on a position-specific scoring matrix (PSSM) named K-PSSM-Composition. The proposed features can efficiently capture the information about 20 amino acid residues and the local information of a given sequence during the evolutionary process. We perform a recursive feature elimination to extract the optimal set of features, which are used to train the support vector machine model for predicting DNA-binding proteins. We evaluate and compare our proposed predictor with other advanced predictors via two standard benchmark data sets. The proposed method achieves the accuracy values of 89.77% and 88.71% for the jackknife test and independent test respectively, outperforming the compared methods. This finding demonstrates the efficacy and effectiveness of the proposed method in predicting the DNA-binding proteins.
机构:
Hainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R China
Bohai Univ, Dept Math, Jinzhou 121013, Peoples R China
Bohai Univ, Res Inst Food Sci, Jinzhou 121013, Peoples R ChinaHainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R China
Li, Chun
Zhao, Jialing
论文数: 0引用数: 0
h-index: 0
机构:
Bohai Univ, Dept Math, Jinzhou 121013, Peoples R ChinaHainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R China
Zhao, Jialing
Wang, Changzhong
论文数: 0引用数: 0
h-index: 0
机构:
Bohai Univ, Dept Math, Jinzhou 121013, Peoples R ChinaHainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R China
Wang, Changzhong
Yao, Yuhua
论文数: 0引用数: 0
h-index: 0
机构:
Hainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R ChinaHainan Normal Univ, Sch Math & Stat, Haikou 571158, Hainan, Peoples R China