Identification and analysis of consensus RNA motifs binding to the genome regulator CTCF

被引:10
|
作者
Kuang, Shuzhen [1 ,2 ]
Wang, Liangjiang [1 ]
机构
[1] Clemson Univ, Dept Genet & Biochem, Clemson, SC 29634 USA
[2] Clemson Univ, Dept Biol Sci, Clemson, SC 29634 USA
关键词
LONG NONCODING RNAS; CELL LUNG-CANCER; GENE-EXPRESSION; ORGANIZATION; PROGRESSION; PROTEINS; SEQUENCE; PROLIFERATION; TRANSCRIPTION; INTERACTOME;
D O I
10.1093/nargab/lqaa031
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
CCCTC-binding factor (CTCF) is a key regulator of 3D genome organization and gene expression. Recent studies suggest that RNA transcripts, mostly long non-coding RNAs (lncRNAs), can serve as locus-specific factors to bind and recruit CTCF to the chromatin. However, it remains unclear whether specific sequence patterns are shared by the CTCF-binding RNA sites, and no RNA motif has been reported so far for CTCF binding. In this study, we have developed DeepLncCTCF, a new deep learning model based on a convolutional neural network and a bidirectional long short-term memory network, to discover the RNA recognition patterns of CTCF and identify candidate lncRNAs binding to CTCF. When evaluated on two different datasets, human U2OS dataset and mouse ESC dataset, DeepLncCTCF was shown to be able to accurately predict CTCF-binding RNA sites from nucleotide sequence. By examining the sequence features learned by DeepLncCTCF, we discovered a novel RNA motif with the consensus sequence, AGAUNGGA, for potential CTCF binding in humans. Furthermore, the applicability of DeepLncCTCF was demonstrated by identifying nearly 5000 candidate lncRNAs that might bind to CTCF in the nucleus. Our results provide useful information for understanding the molecular mechanisms of CTCF function in 3D genome organization.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Sequence search and analysis of gene products containing RNA recognition motifs in the human genome
    Sony Malhotra
    Ramanathan Sowdhamini
    BMC Genomics, 15
  • [32] Sequence search and analysis of gene products containing RNA recognition motifs in the human genome
    Malhotra, Sony
    Sowdhamini, Ramanathan
    BMC GENOMICS, 2014, 15
  • [33] CTCFBSDB 2.0: a database for CTCF-binding sites and genome organization
    Ziebarth, Jesse D.
    Bhattacharya, Anindya
    Cui, Yan
    NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D188 - D194
  • [34] Very fast identification of RNA motifs in genomic DNA. Application to tRNA search in the yeast genome
    ElMabrouk, N
    Lisacek, F
    JOURNAL OF MOLECULAR BIOLOGY, 1996, 264 (01) : 46 - 55
  • [35] CHROMOSOME BIOLOGY CTCF-binding site orientation shapes the genome
    Zlotorynski, Eytan
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2015, 16 (10) : 578 - 578
  • [36] Predicting CTCF cell type active binding sites in human genome
    Chai, Lu
    Gao, Jie
    Li, Zihan
    Sun, Hao
    Liu, Junjie
    Wang, Yong
    Zhang, Lirong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [37] Predictable cholesterol binding sites in GPCRs lack consensus motifs
    Taghon, Geoffrey J.
    Rowe, Jacob B.
    Kapolka, Nicholas J.
    Isom, Daniel G.
    STRUCTURE, 2021, 29 (05) : 499 - +
  • [38] CGGBP1-regulated cytosine methylation at CTCF-binding motifs resists stochasticity
    Manthan Patel
    Divyesh Patel
    Subhamoy Datta
    Umashankar Singh
    BMC Genetics, 21
  • [39] COMP 143-Mechanism of RNA binding by RNA recognition motifs
    Hall, Kathleen B.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 234
  • [40] Identification of strong DNA binding motifs for bleomycin
    Akiyama, Yoshitsugu
    Ma, Qian
    Edgar, Erin
    Laikhter, Andrei
    Hecht, Sidney M.
    Journal of the American Chemical Society, 2008, 130 (30): : 9650 - 9651