Deep learning identifies genome-wide DNA binding sites of long noncoding RNAs

被引:25
|
作者
Wang, Fan [1 ,2 ]
Chainani, Pranik [3 ]
White, Tommy [3 ]
Yang, Jin [1 ]
Liu, Yu [2 ]
Soibam, Benjamin [3 ]
机构
[1] Xi An Jiao Tong Univ, Dept Oncol, Affiliated Hosp 1, Xian, Shaanxi, Peoples R China
[2] Univ Houston, Dept Biol & Biochem, Houston, TX USA
[3] Univ Houston Downtown, Comp Sci & Engn Technol, Houston, TX 77002 USA
关键词
Long noncoding RNAs; deep learning; triplex; MEG3;
D O I
10.1080/15476286.2018.1551704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Long noncoding RNAs (lncRNAs) can exert their function by interacting with the DNA via triplex structure formation. Even though this has been validated with a handful of experiments, a genome-wide analysis of lncRNA-DNA binding is needed. In this paper, we develop and interpret deep learning models that predict the genome-wide binding sites deciphered by ChIRP-Seq experiments of 12 different lncRNAs. Among the several deep learning architectures tested, a simple architecture consisting of two convolutional neural network layers performed the best suggesting local sequence patterns as determinants of the interaction. Further interpretation of the kernels in the model revealed that these local sequence patterns form triplex structures with the corresponding lncRNAs. We uncovered several novel triplexes forming domains (TFDs) of these 12 lncRNAs and previously experimentally verified TFDs of lncRNAs HOTAIR and MEG3. We experimentally verified such two novel TFDs of lncRNAs HOTAIR and TUG1 predicted by our method (but previously unreported) using Electrophoretic mobility shift assays. In conclusion, we show that simple deep learning architecture can accurately predict genome-wide binding sites of lncRNAs and interpretation of the models suggest RNA:DNA:DNA triplex formation as a viable mechanism underlying lncRNA-DNA interactions at genome-wide level.
引用
收藏
页码:1468 / 1476
页数:9
相关论文
共 50 条
  • [1] Genome-wide methods for investigating long noncoding RNAs
    Cao, Mei
    Zhao, Jian
    Hu, Guoku
    [J]. BIOMEDICINE & PHARMACOTHERAPY, 2019, 111 : 395 - 401
  • [2] RETRACTED: Genome-Wide Screening Identifies Prognostic Long Noncoding RNAs in Hepatocellular Carcinoma (Retracted Article)
    Feng, Yujie
    Hu, Xiao
    Ma, Kai
    Zhang, Bingyuan
    Sun, Chuandong
    [J]. BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [3] Genome-Wide Differential Transcription of Long Noncoding RNAs in Psoriatic Skin
    Stacey, Valerie M.
    Koks, Sulev
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (22)
  • [4] Genome-Wide Analysis of Human SNPs at Long Intergenic Noncoding RNAs
    Chen, Geng
    Qiu, Chengxiang
    Zhang, Qipeng
    Liu, Bing
    Cui, Qinghua
    [J]. HUMAN MUTATION, 2013, 34 (02) : 338 - 344
  • [5] Genome-Wide Analysis of Human Long Noncoding RNAs: A Provocative Review
    Ponting, Chris P.
    Haerty, Wilfried
    [J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2022, 23 : 153 - 172
  • [6] In Vivo Genome-Wide CRISPR Activation Screening Identifies Functionally Important Long Noncoding RNAs in Hepatocellular Carcinoma
    Wong, Lok-Sze
    Wei, Lai
    Wang, Gengchao
    Law, Cheuk-Ting
    Tsang, Felice Ho-Ching
    Chin, Wai-Ching
    Ng, Irene Oi-Lin
    Wong, Chun-Ming
    [J]. CELLULAR AND MOLECULAR GASTROENTEROLOGY AND HEPATOLOGY, 2022, 14 (05): : 1053 - 1076
  • [7] Detection of RNA-DNA binding sites in long noncoding RNAs
    Kuo, Chao-Chung
    Haenzelmann, Sonja
    Cetin, Nevcin Sentuerk
    Frank, Stefan
    Zajzon, Barna
    Derks, Jens-Peter
    Akhade, Vijay Suresh
    Ahuja, Gaurav
    Kanduri, Chandrasekhar
    Grummt, Ingrid
    Kurian, Leo
    Costa, Ivan G.
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (06)
  • [8] Genome-wide identification of Arabidopsis long noncoding RNAs in response to the blue light
    Zhenfei Sun
    Kai Huang
    Zujing Han
    Pan Wang
    Yuda Fang
    [J]. Scientific Reports, 10
  • [9] Genome-wide discovery and characterization of long noncoding RNAs in patients with multiple myeloma
    Lu, Minqiu
    Hu, Ying
    Wu, Yin
    Zhou, Huixing
    Jian, Yuan
    Tian, Ying
    Chen, Wenming
    [J]. BMC MEDICAL GENOMICS, 2019, 12 (01)
  • [10] Genome-wide discovery and characterization of long noncoding RNAs in patients with multiple myeloma
    Minqiu Lu
    Ying Hu
    Yin Wu
    Huixing Zhou
    Yuan Jian
    Ying Tian
    Wenming Chen
    [J]. BMC Medical Genomics, 12