共 50 条
Interpretation of allele-specific chromatin accessibility using cell state-aware deep learning
被引:24
|作者:
Atak, Zeynep Kalender
[1
,2
,5
]
Taskiran, Ibrahim Ihsan
[1
,2
]
Demeulemeester, Jonas
[1
,2
,3
]
Flerin, Christopher
[1
,2
]
Mauduit, David
[1
,2
]
Minnoye, Liesbeth
[1
,2
]
Hulselmans, Gert
[1
,2
]
Christiaens, Valerie
[1
,2
]
Ghanem, Ghanem-Elias
[4
]
Wouters, Jasper
[1
,2
]
Aerts, Stein
[1
,2
]
机构:
[1] VIB KU Leuven Ctr Brain & Dis Res, B-3000 Leuven, Belgium
[2] Katholieke Univ Leuven, Dept Human Genet, B-3000 Leuven, Belgium
[3] Francis Crick Inst, Canc Genom Lab, London NW1 1AT, England
[4] Univ Libre Bruxelles, Inst Jules Bordet, B-1000 Brussels, Belgium
[5] Univ Cambridge, Canc Res UK Cambridge Inst, Cambridge CB2 0RE, England
基金:
欧洲研究理事会;
关键词:
TERT PROMOTER MUTATIONS;
FACTOR-DNA-BINDING;
REGULATORY ELEMENTS;
EXPRESSION;
TRANSCRIPTION;
VARIANTS;
IDENTIFICATION;
ZEB;
ARCHITECTURE;
NETWORK;
D O I:
10.1101/gr.260851.120
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
Genomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact cis-regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation. Here we generate phased whole genomes with matched chromatin accessibility, histone modifications, and gene expression for 10 melanoma cell lines. We find that training a specialized deep learning model, called DeepMEL2, on melanoma chromatin accessibility data can capture the various regulatory programs of the melanocytic and mesenchymal-like melanoma cell states. This model outperforms motif-based variant scoring, as well as more generic deep learning models. We detect hundreds to thousands of allele-specific chromatin accessibility variants (ASCAVs) in each melanoma genome, of which 15%-20% can be explained by gains or losses of transcription factor binding sites. A considerable fraction of ASCAVs are caused by changes in AP-1 binding, as confirmed by matched ChIP-seq data to identify allele-specific binding of JUN and FOSL1. Finally, by augmenting the DeepMEL2 model with ChIP-seq data for GABPA, the TERT promoter mutation, as well as additional ETS motif gains, can be identified with high confidence. In conclusion, we present a new integrative genomics approach and a deep learning model to identify and interpret functional enhancer mutations with allelic imbalance of chromatin accessibility and gene expression.
引用
收藏
页码:1082 / 1096
页数:16
相关论文