iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties

被引:185
|
作者
Chen, Wei [1 ,5 ]
Lin, Hao [2 ]
Feng, Peng-Mian [3 ]
Ding, Chen [2 ]
Zuo, Yong-Chun [4 ]
Chou, Kuo-Chen [5 ]
机构
[1] Hebei United Univ, Dept Phys, Sch Sci, Ctr Genom & Computat Biol, Tangshan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Key Lab Neuroinformat, Minist Educ,Ctr Bioinformat, Chengdu 610054, Peoples R China
[3] Hebei United Univ, Sch Publ Hlth, Tangshan, Peoples R China
[4] Inner Mongolia Univ, Natl Res Ctr Anim Transgen Biotechnol, Hohhot, Peoples R China
[5] Gordon Life Sci Inst, San Diego, CA USA
来源
PLOS ONE | 2012年 / 7卷 / 10期
关键词
AMINO-ACID-COMPOSITION; PROTEIN STRUCTURAL CLASS; MODIFIED MAHALANOBIS DISCRIMINANT; SUBCELLULAR LOCATION PREDICTION; HIV-1; REVERSE-TRANSCRIPTASE; HUMAN GENOME; SIGNAL PEPTIDES; ENZYME-KINETICS; HIGH-RESOLUTION; GRAPHIC RULES;
D O I
10.1371/journal.pone.0047843
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nucleosome positioning has important roles in key cellular processes. Although intensive efforts have been made in this area, the rules defining nucleosome positioning is still elusive and debated. In this study, we carried out a systematic comparison among the profiles of twelve DNA physicochemical features between the nucleosomal and linker sequences in the Saccharomyces cerevisiae genome. We found that nucleosomal sequences have some position-specific physicochemical features, which can be used for in-depth studying nucleosomes. Meanwhile, a new predictor, called iNuc-PhysChem, was developed for identification of nucleosomal sequences by incorporating these physicochemical properties into a 1788-D (dimensional) feature vector, which was further reduced to a 884-D vector via the IFS (incremental feature selection) procedure to optimize the feature set. It was observed by a cross-validation test on a benchmark dataset that the overall success rate achieved by iNuc-PhysChem was over 96% in identifying nucleosomal or linker sequences. As a web-server, iNuc-PhysChem is freely accessible to the public at http://lin.uestc.edu.cn/server/iNuc-PhysChem. For the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented just for the integrity in developing the predictor. Meanwhile, for those who prefer to run predictions in their own computers, the predictor's code can be easily downloaded from the web-server. It is anticipated that iNuc-PhysChem may become a useful high throughput tool for both basic research and drug design.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A Novel Hybrid Sequence-Based Model for Identifying Anticancer Peptides
    Xu, Lei
    Liang, Guangmin
    Wang, Longjie
    Liao, Changrui
    GENES, 2018, 9 (03)
  • [22] SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks
    Pati, Suchita
    Aga, Shaizeen
    Sinclair, Matthew D.
    Jayasena, Nuwan
    2020 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2020, : 69 - 80
  • [23] GPApred: The first computational predictor for identifying proteins with LPXTG-like motif using sequence-based optimal features
    Malik, Adeel
    Shoombuatong, Watshara
    Kim, Chang-Bae
    Manavalan, Balachandran
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 229 : 529 - 538
  • [24] iRNA-2OM: A Sequence-Based Predictor for Identifying 2′-O-Methylation Sites in Homo sapiens
    Yang, Hui
    Lv, Hao
    Ding, Hui
    Chen, Wei
    Lin, Hao
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (11) : 1266 - 1277
  • [25] iEnhancer-XG: interpretable sequence-based enhancers and their strength predictor
    Cai, Lijun
    Ren, Xuanbai
    Fu, Xiangzheng
    Peng, Li
    Gao, Mingyu
    Zeng, Xiangxiang
    BIOINFORMATICS, 2021, 37 (08) : 1060 - 1067
  • [26] iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition
    Chen, Wei
    Feng, Peng-Mian
    Deng, En-Ze
    Lin, Hao
    Chou, Kuo-Chen
    ANALYTICAL BIOCHEMISTRY, 2014, 462 : 76 - 83
  • [27] Mining of Gram-Negative Surface-Active Enzybiotic Candidates by Sequence-Based Calculation of Physicochemical Properties
    Vazquez, Roberto
    Blanco-Ganan, Sofia
    Ruiz, Susana
    Garcia, Pedro
    FRONTIERS IN MICROBIOLOGY, 2021, 12
  • [28] iQSP: A Sequence-Based Tool for the Prediction and Analysis of Quorum Sensing Peptides via Chou's 5-Steps Rule and Informative Physicochemical Properties
    Charoenkwan, Phasit
    Schaduangrat, Nalini
    Nantasenamat, Chanin
    Piacham, Theeraphon
    Shoombuatong, Watshara
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (01)
  • [29] GGIP: Structure and sequence-based GPCR-GPCR interaction pair predictor
    Nemoto, Wataru
    Yamanishi, Yoshihiro
    Limviphuvadh, Vachiranee
    Saito, Akira
    Toh, Hiroyuki
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2016, 84 (09) : 1224 - 1233
  • [30] CLPred: a sequence-based protein crystallization predictor using BLSTM neural network
    Xuan, Wenjing
    Liu, Ning
    Huang, Neng
    Li, Yaohang
    Wang, Jianxin
    BIOINFORMATICS, 2020, 36 : I709 - I717