The development of next-generation sequencing technologies has opened-up some new possibilities to explore the contribution of genetic variants to human diseases and in particular that of rare variants. Statistical methods have been developed to test for association with rare variants that require the definition of testing units and, in these testing units, the selection of qualifying variants to include in the test. In the coding regions of the genome, testing units are usually the different genes and qualifying variants are selected based on their functional effects on the encoded proteins. Extending these tests to the non-coding regions of the genome is challenging. Testing units are difficult to define as the non-coding genome organisation is still rather unknown. Qualifying variants are difficult to select as the functional impact of non-coding variants on gene expression is hard to predict. These difficulties could explain why very few investigators so far have analysed the non-coding parts of their whole genome sequencing data. These non-coding parts yet represent the vast majority of the genome and some studies suggest that they could play a major role in disease susceptibility. In this review, we discuss recent experimental and statistical developments to gain knowledge on the non-coding genome and how this knowledge could be used to include rare non-coding variants in association tests. We describe the few studies that have considered variants from the non-coding genome in association tests and how they managed to define testing units and select qualifying variants.
机构:
Univ Calif Berkeley, Dept Mol & Cell Biol, Div Cellular & Dev Biol, Berkeley, CA 94720 USAUniv Calif Berkeley, Dept Mol & Cell Biol, Div Cellular & Dev Biol, Berkeley, CA 94720 USA
Xue, Bin
He, Lin
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Mol & Cell Biol, Div Cellular & Dev Biol, Berkeley, CA 94720 USAUniv Calif Berkeley, Dept Mol & Cell Biol, Div Cellular & Dev Biol, Berkeley, CA 94720 USA