IDPpred: a new sequence-based predictor for identification of intrinsically disordered protein with enhanced accuracy

被引:1
|
作者
Chaurasiya, Deepak [1 ]
Mondal, Rajkrishna [2 ]
Lahiri, Tapobrata [1 ]
Tripathi, Asmita [1 ]
Ghinmine, Tejas [1 ]
机构
[1] Indian Inst Informat Technol, Dept Appl Sci, Prayagraj, Uttar Pradesh, India
[2] Nagaland Univ, Dept Biotechnol, Dimapur, Nagaland, India
来源
关键词
Intrinsically disordered protein; numerical representation of sequence; periodicity count value and predictor; REGIONS;
D O I
10.1080/07391102.2023.2290615
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Discovery of intrinsically disordered proteins (IDPs) and protein hybrids that contain both intrinsically disordered protein regions (IDPRs) along with ordered regions has changed the sequence-structure-function paradigm of protein. These proteins with lack of persistently fixed structure are often found in all organisms and play vital roles in various biological processes. Some of them are considered as potential drug targets due to their overrepresentation in pathophysiological processes. The major bottlenecks for characterizing such proteins are their occasional overexpression, difficulty in getting purified homogeneous form and the challenge of investigating them experimentally. Sequence-based prediction of intrinsic disorder remains a useful strategy especially for many large-scale proteomic investigations. However, worst accuracy still occurs for short disordered regions with less than ten residues, for the residues close to order-disorder boundaries, for regions that undergo coupled folding and binding in presence of partner, and for prediction of fully disordered proteins. Annotation of fully disordered proteins mostly relies on the far-UV circular dichroism experiment which gives overall secondary structure composition without residue-level resolution. Current methods including that using secondary structure information failed to predict half of target IDPs correctly in the recent Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment. This study utilized profiles of random sequential appearance of physicochemical properties of amino acids and random sequential appearance of order and disorder promoting amino acids in protein together with the existing CIDER feature for the prediction of IDP from sequence input. Our method was found to significantly outperform the existing predictors across different datasets.
引用
收藏
页码:957 / 965
页数:9
相关论文
共 50 条
  • [1] DeepCLD: An Efficient Sequence-Based Predictor of Intrinsically Disordered Proteins
    Fang, Min
    He, Yufeng
    Du, Zhihua
    Uversky, Vladimir N.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3154 - 3159
  • [2] Sequence-Based Fingerprinting of Intrinsically Disordered Regions
    Ginell, Garrett M.
    Cohan, Megan C.
    Holehouse, Alex S.
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 179A - 179A
  • [3] Integrative Sequence-Based Classification of Intrinsically Disordered Regions
    Ginell, Garrett M.
    Lalmansingh, Jared
    Cohan, Megan C.
    Holehouse, Alex S.
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 217A - 217A
  • [4] Towards Decoding the Sequence-Based Grammar Governing the Functions of Intrinsically Disordered Protein Regions
    Chong, Shasha
    Mir, Mustafa
    JOURNAL OF MOLECULAR BIOLOGY, 2021, 433 (12)
  • [5] APOD: accurate sequence-based predictor of disordered flexible linkers
    Peng, Zhenling
    Xing, Qian
    Kurgan, Lukasz
    BIOINFORMATICS, 2020, 36 : I754 - I761
  • [6] Will my protein crystallize? A sequence-based predictor
    Smialowski, P
    Schmidt, T
    Cox, J
    Kirschner, A
    Frishman, D
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (02) : 343 - 355
  • [7] A Sequence-based Approach for Predicting Protein Disordered Regions
    Huang, Tao
    He, Zhi-Song
    Cui, Wei-Ren
    Cai, Yu-Dong
    Shi, Xiao-He
    Hu, Le-Le
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03): : 243 - 248
  • [8] SeBPPI: A Sequence-Based Protein-Protein Binding Predictor
    Wang, Bo
    Mao, Jun
    Wei, Min
    Qi, Yifei
    Zhang, John Z. H.
    JOURNAL OF COMPUTATIONAL BIOPHYSICS AND CHEMISTRY, 2022, 21 (06): : 729 - 737
  • [9] BCrystal: an interpretable sequence-based protein crystallization predictor
    Elbasir, Abdurrahman
    Mall, Raghvendra
    Kunji, Khalid
    Rawi, Reda
    Islam, Zeyaul
    Chuang, Gwo-Yu
    Kolatkar, Prasanna R.
    Bensmail, Halima
    BIOINFORMATICS, 2020, 36 (05) : 1429 - 1438
  • [10] Preparation of Bioderived and Biodegradable Surfactants Based on an Intrinsically Disordered Protein Sequence
    Klass, Sarah H.
    Gleason, Jamie M.
    Omole, Anthony O.
    Onoa, Bibiana
    Bustamante, Carlos J.
    Francis, Matthew B.
    BIOMACROMOLECULES, 2022, 23 (03) : 1462 - 1470