iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties

被引:185
|
作者
Chen, Wei [1 ,5 ]
Lin, Hao [2 ]
Feng, Peng-Mian [3 ]
Ding, Chen [2 ]
Zuo, Yong-Chun [4 ]
Chou, Kuo-Chen [5 ]
机构
[1] Hebei United Univ, Dept Phys, Sch Sci, Ctr Genom & Computat Biol, Tangshan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Key Lab Neuroinformat, Minist Educ,Ctr Bioinformat, Chengdu 610054, Peoples R China
[3] Hebei United Univ, Sch Publ Hlth, Tangshan, Peoples R China
[4] Inner Mongolia Univ, Natl Res Ctr Anim Transgen Biotechnol, Hohhot, Peoples R China
[5] Gordon Life Sci Inst, San Diego, CA USA
来源
PLOS ONE | 2012年 / 7卷 / 10期
关键词
AMINO-ACID-COMPOSITION; PROTEIN STRUCTURAL CLASS; MODIFIED MAHALANOBIS DISCRIMINANT; SUBCELLULAR LOCATION PREDICTION; HIV-1; REVERSE-TRANSCRIPTASE; HUMAN GENOME; SIGNAL PEPTIDES; ENZYME-KINETICS; HIGH-RESOLUTION; GRAPHIC RULES;
D O I
10.1371/journal.pone.0047843
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nucleosome positioning has important roles in key cellular processes. Although intensive efforts have been made in this area, the rules defining nucleosome positioning is still elusive and debated. In this study, we carried out a systematic comparison among the profiles of twelve DNA physicochemical features between the nucleosomal and linker sequences in the Saccharomyces cerevisiae genome. We found that nucleosomal sequences have some position-specific physicochemical features, which can be used for in-depth studying nucleosomes. Meanwhile, a new predictor, called iNuc-PhysChem, was developed for identification of nucleosomal sequences by incorporating these physicochemical properties into a 1788-D (dimensional) feature vector, which was further reduced to a 884-D vector via the IFS (incremental feature selection) procedure to optimize the feature set. It was observed by a cross-validation test on a benchmark dataset that the overall success rate achieved by iNuc-PhysChem was over 96% in identifying nucleosomal or linker sequences. As a web-server, iNuc-PhysChem is freely accessible to the public at http://lin.uestc.edu.cn/server/iNuc-PhysChem. For the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented just for the integrity in developing the predictor. Meanwhile, for those who prefer to run predictions in their own computers, the predictor's code can be easily downloaded from the web-server. It is anticipated that iNuc-PhysChem may become a useful high throughput tool for both basic research and drug design.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
    Wei, Leyi
    Chen, Huangrong
    Su, Ran
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 12 : 635 - 644
  • [32] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Yuqing Qian
    Limin Jiang
    Yijie Ding
    Jijun Tang
    Fei Guo
    BMC Bioinformatics, 22
  • [33] A sequence-based multiple kernel model for identifying DNA-binding proteins
    Qian, Yuqing
    Jiang, Limin
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 3)
  • [34] ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors
    Chu, Wen-Yi
    Huang, Yu-Feng
    Huang, Chun-Chin
    Cheng, Yi-Sheng
    Huang, Chien-Kang
    Oyang, Yen-Jen
    NUCLEIC ACIDS RESEARCH, 2009, 37 : W396 - W401
  • [35] SeqSVM: A Sequence-Based Support Vector Machine Method for Identifying Antioxidant Proteins
    Xu, Lei
    Liang, Guangmin
    Shi, Shuhua
    Liao, Changrui
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (06):
  • [36] POSEIDON: Peptidic Objects SEquence-based Interaction with cellular DOmaiNs: a new database and predictor
    Preto, Antonio J.
    Caniceiro, Ana B.
    Duarte, Francisco
    Fernandes, Hugo
    Ferreira, Lino
    Mourao, Joana
    Moreira, Irina S.
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01)
  • [37] IDPpred: a new sequence-based predictor for identification of intrinsically disordered protein with enhanced accuracy
    Chaurasiya, Deepak
    Mondal, Rajkrishna
    Lahiri, Tapobrata
    Tripathi, Asmita
    Ghinmine, Tejas
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2025, 43 (02): : 957 - 965
  • [38] DSResSol: A Sequence-Based Solubility Predictor Created with Dilated Squeeze Excitation Residual Networks
    Madani, Mohammad
    Lin, Kaixiang
    Tarakanova, Anna
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (24)
  • [39] HyperCys: A Structure- and Sequence-Based Predictor of Hyper-Reactive Druggable Cysteines
    Gao, Mingjie
    Guenther, Stefan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (06)
  • [40] POSEIDON: Peptidic Objects SEquence-based Interaction with cellular DOmaiNs: a new database and predictor
    António J. Preto
    Ana B. Caniceiro
    Francisco Duarte
    Hugo Fernandes
    Lino Ferreira
    Joana Mourão
    Irina S. Moreira
    Journal of Cheminformatics, 16