The mosaic structure of human pericentromeric DNA: A strategy for characterizing complex regions of the human genome

被引:86
|
作者
Horvath, JE
Schwartz, S
Eichler, EE [1 ]
机构
[1] Case Western Reserve Univ Hosp, Sch Med, Dept Genet, Cleveland, OH 44106 USA
[2] Case Western Reserve Univ Hosp, Sch Med, Ctr Human Genet, Cleveland, OH 44106 USA
关键词
D O I
10.1101/gr.10.6.839
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The pericentromeric regions of human chromosomes pose particular problems for both mapping and sequencing. These difficulties are due, in large part, to the presence of duplicated genomic segments that are distributed among multiple human chromosomes. To ensure contiguity of genomic sequence in these regions, we designed a sequence-based strategy to characterize different pericentromeric regions using a single (162 kb) 2p11 seed sequence as a point of reference. Molecular and cytogenetic techniques were first used to construct a paralogy map that delineated the interchromosomal distribution of duplicated segments throughout the human genome. Monochromosomal hybrid DNAs were PCR amplified by primer pairs designed to the 2p11 reference sequence. The PCR products were directly sequenced and used to develop a catalog of sequence tags for each duplicon for each chromosome. A total of 685 paralogous sequence variants were generated by sequencing 34.7 kb of paralogous pericentromeric sequence. Using PCR products as hybridization probes, we were able to identify 702 human BAC clones, of which a subset, 107 clones, were analyzed at the sequence level. We used diagnostic paralogous sequence variants to assign 65 of these BACs to at least 9 chromosomal pericentromeric regions: 1q12, 2p11, 9p11/q12, 10p11, 14q11, 15q11, 16p11, 17p11, and 22q11. Comparisons with existing sequence and physical maps for the human genome suggest that many of these BACs map to regions of the genome with sequence gaps. Our analysis indicates that large portions of pericentromeric DNA are virtually devoid of unique sequences. Instead, they consist of a mosaic of different genomic segments that have had different propensities for duplication. These biologic properties may be exploited for the rapid characterization of, not only pericentromeric DNA, bur also other complex paralogous regions of the human genome.
引用
收藏
页码:839 / 852
页数:14
相关论文
共 50 条
  • [31] Novel Transcribed Regions in the Human Genome
    Rozowsky, J.
    Wu, J.
    Lian, Z.
    Nagalakshmi, U.
    Korbel, J. O.
    Kapranov, P.
    Zheng, D.
    Dyke, S.
    Newburger, P.
    Miller, P.
    Gingeras, T. R.
    Weissman, S.
    Gerstein, M.
    Snyder, M.
    REGULATORY RNAS, 2006, 71 : 111 - 116
  • [32] ASSIGNMENT OF DNA MARKERS TO THE PERICENTROMERIC REGION OF THE HUMAN X-CHROMOSOME
    LAFRENIERE, RG
    MAHTANI, MM
    BROWN, CJ
    SHARP, CB
    DAVIES, KE
    WILLARD, HF
    CYTOGENETICS AND CELL GENETICS, 1989, 51 (1-4): : 1028 - 1028
  • [33] HUMAN GENOME STRUCTURE
    KAO, FT
    INTERNATIONAL REVIEW OF CYTOLOGY-A SURVEY OF CELL BIOLOGY, 1985, 96 : 51 - 88
  • [34] Correlation of DNA hypomethylation at pericentromeric heterochromatin regions of chromosomes 16 and 1 with histological features and chromosomal abnormalities of human breast carcinomas
    Tsuda, H
    Takarabe, T
    Kanai, Y
    Fukutomi, T
    Hirohashi, S
    AMERICAN JOURNAL OF PATHOLOGY, 2002, 161 (03): : 859 - 866
  • [35] Annotation of suprachromosomal families reveals uncommon types of alpha satellite organization in pericentromeric regions of hg38 human genome assembly
    Shepelev, V. A.
    Uralsky, L. I.
    Alexandrov, A. A.
    Yurov, Y. B.
    Rogaev, E. I.
    Alexandrov, I. A.
    GENOMICS DATA, 2015, 5 : 139 - 146
  • [36] DNA repeats in the human genome
    Catasti, P
    Chen, X
    Mariappan, SVS
    Bradbury, EM
    Gupta, G
    GENETICA, 1999, 106 (1-2) : 15 - 36
  • [37] DNA repeats in the human genome
    Paolo Catasti
    Xian Chen
    S.V. Santhana Mariappan
    E. Morton Bradbury
    Goutam Gupta
    Genetica, 1999, 106 : 15 - 36
  • [38] Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases
    Li, Wentian
    Freudenberg, Jan
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2014, 53 : 108 - 117
  • [39] Structure Validation of G-Rich RNAs in Noncoding Regions of the Human Genome
    Binas, Oliver
    Bessi, Irene
    Schwalbe, Harald
    CHEMBIOCHEM, 2020, 21 (11) : 1656 - 1663
  • [40] CARB 20-Structure-based identification of functional regions in the human genome
    Tullius, Thomas D.
    Parker, Stephen C. J.
    Bishop, Eric
    Hansen, Loren
    Margulies, Elliott H.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 234