Deep Learning Sequence Models for Transcriptional Regulation

被引:2
|
作者
Sokolova, Ksenia [1 ,2 ]
Chen, Kathleen M. [1 ,2 ]
Hao, Yun [3 ]
Zhou, Jian [4 ]
Troyanskaya, Olga G. [1 ,2 ,3 ,5 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[3] Simons Fdn, Flatiron Inst, New York, NY USA
[4] Univ Texas Southwestern Med Ctr Dallas, Lyda Hill Dept Bioinformat, Dallas, TX 75390 USA
[5] Princeton Univ, Princeton Precis Hlth, Princeton, NJ 08544 USA
关键词
AI; machine learning; deep learning; genomics; sequence models; transcriptional regulation; RNA-BINDING; GENE-EXPRESSION; NEURAL-NETWORKS; DNA-SEQUENCE; CHROMATIN; PROMOTER; GENOME; PRINCIPLES; VARIANTS; REGIONS;
D O I
10.1146/annurev-genom-021623-024727
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Deciphering the regulatory code of gene expression and interpreting the transcriptional effects of genome variation are critical challenges in human genetics. Modern experimental technologies have resulted in an abundance of data, enabling the development of sequence-based deep learning models that link patterns embedded in DNA to the biochemical and regulatory properties contributing to transcriptional regulation, including modeling epigenetic marks, 3D genome organization, and gene expression, with tissue and cell-type specificity. Such methods can predict the functional consequences of any noncoding variant in the human genome, even rare or never-before-observed variants, and systematically characterize their consequences beyond what is tractable from experiments or quantitative genetics studies alone. Recently, the development and application of interpretability approaches have led to the identification of key sequence patterns contributing to the predicted tasks, providing insights into the underlying biological mechanisms learned and revealing opportunities for improvement in future models.
引用
收藏
页码:105 / 122
页数:18
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Sequence-to-Sequence Models
    Keneshloo, Yaser
    Shi, Tian
    Ramakrishnan, Naren
    Reddy, Chandan K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2469 - 2489
  • [2] A community effort to optimize sequence-based deep learning models of gene regulation
    Rafi, Abdul Muntakim
    Nogina, Daria
    Penzar, Dmitry
    Lee, Dohoon
    Lee, Danyeong
    Kim, Nayeon
    Kim, Sangyeup
    Kim, Dohyeon
    Shin, Yeojin
    Kwak, Il-Youp
    Meshcheryakov, Georgy
    Lando, Andrey
    Zinkevich, Arsenii
    Kim, Byeong-Chan
    Lee, Juhyun
    Kang, Taein
    Vaishnav, Eeshit Dhaval
    Yadollahpour, Payman
    Kim, Sun
    Albrecht, Jake
    Regev, Aviv
    Gong, Wuming
    Kulakovskiy, Ivan V.
    Meyer, Pablo
    de Boer, Carl G.
    Bornelov, Susanne
    Svensson, Fredrik
    Trapotsi, Maria-Anna
    Duc Tran
    Tin Nguyen
    Tu, Xinming
    Zhang, Wuwei
    Qiu, Wei
    Ghotra, Rohan
    Yu, Yiyang
    Labelson, Ethan
    Prakash, Aayush
    Narayanan, Ashwin
    Koo, Peter
    Chen, Xiaoting
    Jones, David T.
    Tinti, Michele
    Guan, Yuanfang
    Ding, Maolin
    Chen, Ken
    Yang, Yuedong
    Ding, Ke
    Dixit, Gunjan
    Wen, Jiayu
    Zhou, Zhihan
    NATURE BIOTECHNOLOGY, 2024,
  • [3] Modeling transcriptional regulation of model species with deep learning
    Cofer, Evan M.
    Raimundo, Joao
    Tadych, Alicja
    Yamazaki, Yuji
    Wong, Aaron K.
    Theesfeld, Chandra L.
    Levine, Michael S.
    Troyanskaya, Olga G.
    GENOME RESEARCH, 2021, 31 (06) : 1097 - 1105
  • [4] Thermodynamics-based models of transcriptional regulation with gene sequence
    Wang, Shuqiang
    Shen, Yanyan
    Hu, Jinxing
    BIOPROCESS AND BIOSYSTEMS ENGINEERING, 2015, 38 (12) : 2469 - 2476
  • [5] Thermodynamics-based models of transcriptional regulation with gene sequence
    Shuqiang Wang
    Yanyan Shen
    Jinxing Hu
    Bioprocess and Biosystems Engineering, 2015, 38 : 2469 - 2476
  • [6] Sequence to sequence deep learning models for solar irradiation forecasting
    Mukhoty, Bhaskar Pratim
    Maurya, Vikas
    Shukla, Sandeep Kumar
    2019 IEEE MILAN POWERTECH, 2019,
  • [7] Robot failure mode prediction with deep learning sequence models
    Khalil Damak
    Mariem Boujelbene
    Cagla Acun
    Aneseh Alvanpour
    Sumit K. Das
    Dan O. Popa
    Olfa Nasraoui
    Neural Computing and Applications, 2025, 37 (6) : 4291 - 4302
  • [8] Affibody Sequence Design Using Deep Learning Generative Models
    Wang, Zirui
    Mardikoraem, Mehrsa
    Woldring, Daniel
    PROTEIN SCIENCE, 2023, 32
  • [9] Transcriptional regulation by the numbers: models
    Bintu, L
    Buchler, NE
    Garcia, HG
    Gerland, U
    Hwa, T
    Kondev, J
    Phillips, R
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2005, 15 (02) : 116 - 124
  • [10] Residual Recurrent Highway Networks for Learning Deep Sequence Prediction Models
    Tehseen Zia
    Saad Razzaq
    Journal of Grid Computing, 2020, 18 : 169 - 176