Identification of Intrinsically Disordered Proteins and Regions by Length-Dependent Predictors Based on Conditional Random Fields

被引:13
|
作者
Liu, Yumeng [1 ]
Chen, Shengyu [2 ]
Wang, Xiaolong [1 ]
Liu, Bin [1 ,3 ,4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Indiana Univ, Sch Informat Comp & Engn, Bloomington, IN 47408 USA
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[4] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
ACCURATE PREDICTION; UNSTRUCTURED REGIONS; SEQUENCE; SERVER;
D O I
10.1016/j.omtn.2019.06.004
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Accurate identification of intrinsically disordered proteins/regions (IDPs/IDRs) is critical for predicting protein structure and function. Previous studies have shown that IDRs of different lengths have different characteristics, and several classification-based predictors have been proposed for predicting different types of IDRs. Compared with these classification-based predictors, the previously proposed predictor IDP-CRF exhibits state-of-the-art performance for predicting IDPs/IDRs, which is a sequence labeling model based on conditional random fields (CRFs). Motivated by these methods, we propose a predictor called IDP-FSP, which is an ensemble of three CRF-based predictors called IDP-FSP-L, IDP-FSP-S, and IDP-FSP-G. These three predictors are specially designed to predict long, short, and generic disordered regions, respectively, and they are constructed based on different features. To the best of our knowledge, IDP-FSP is the first predictor that combines a sequence labeling algorithm with IDRs of different lengths. Experimental results using two independent test datasets show that IDP-FSP achieves better or at least comparable predictive performance with 26 existing state-of-the-art methods in this field, proving the effectiveness of IDP-FSP.
引用
收藏
页码:396 / 404
页数:9
相关论文
共 31 条
  • [1] Length-dependent compaction of intrinsically disordered proteins
    Uversky, Vladimir N.
    Santambrogio, Carlo
    Brocca, Stefania
    Grandori, Rita
    FEBS LETTERS, 2012, 586 (01) : 70 - 73
  • [2] IDP-CRF: Intrinsically Disordered Protein/Region Identification Based on Conditional Random Fields
    Liu, Yumeng
    Wang, Xiaolong
    Liu, Bin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (09)
  • [3] Identification of Disordered Regions of Intrinsically Disordered Proteins by Multi-features Fusion
    Canzhuang, Sun
    Yonge, Feng
    CURRENT BIOINFORMATICS, 2021, 16 (09) : 1126 - 1132
  • [4] Prediction of the Disordered Regions of Intrinsically Disordered Proteins Based on the Molecular Functions
    Xie, WeiXia
    Feng, Yong E.
    PROTEIN AND PEPTIDE LETTERS, 2020, 27 (04): : 279 - 286
  • [5] An assignment of intrinsically disordered regions of proteins based on NMR structures
    Ota, Motonori
    Koike, Ryotaro
    Amemiya, Takayuki
    Tenno, Takeshi
    Romero, Pedro R.
    Hiroaki, Hidekazu
    Dunker, A. Keith
    Fukuchi, Satoshi
    JOURNAL OF STRUCTURAL BIOLOGY, 2013, 181 (01) : 29 - 36
  • [6] Identification and Prediction of Intrinsically Disordered Regions in Proteins Using n-grams
    Oberti, Mauricio
    Vaisman, Iosif I.
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 67 - 72
  • [7] Domain dependent word segmentation based on conditional random fields
    Fukuda, Takuya
    Izumi, Masataka
    Miura, Takao
    2007 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 264 - 267
  • [8] Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
    Basu, Sankar
    Soderquist, Fredrik
    Wallner, Bjorn
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (05) : 453 - 466
  • [9] IDRWalker: A Random Walk Based Tool for Generating Intrinsically Disordered Regions in Large Protein Complexes
    Chen, Guanglin
    Zhang, Zhiyong
    ACS OMEGA, 2024, 9 (29): : 32059 - 32065
  • [10] Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins
    Sankar Basu
    Fredrik Söderquist
    Björn Wallner
    Journal of Computer-Aided Molecular Design, 2017, 31 : 453 - 466