Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

被引:11
|
作者
Guo, Xiang [1 ,2 ]
Silva, Joana C. [1 ,3 ,4 ]
机构
[1] J Craig Venter Inst, Inst Genom Res, Rockville, MD 20850 USA
[2] SAIC Frederick Inc, NCI Frederick, Adv Biomed Comp Ctr, Frederick, MD 21702 USA
[3] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[4] Univ Maryland, Sch Med, Dept Microbiol & Immunol, Baltimore, MD 21201 USA
关键词
D O I
10.1186/1471-2164-9-582
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results: Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know regulatory motifs in other species. These results suggest that these two motifs are likely to represent transcription factor binding sites in Theileria. Conclusion: Theileria genomes are highly compact, with selection seemingly favoring short introns and intergenic regions. Three over-represented sequence motifs were independently identified in intergenic regions of both Theileria species, and the evidence suggests that at least two of them play a role in transcriptional control in T. parva. These are prime candidates for experimental validation of transcription factor binding sites in this single-celled eukaryotic parasite. Sequences similar to two of these Theileria motifs are conserved in Plasmodium hinting at the possibility of common regulatory machinery across the phylum Apicomplexa.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Identification of altered cis-regulatory elements in human disease
    Mathelier, Anthony
    Shi, Wenqiang
    Wasserman, Wyeth W.
    TRENDS IN GENETICS, 2015, 31 (02) : 67 - 76
  • [22] Analysis of putative cis-regulatory elements regulating blood pressure variation
    Nandakumar, Priyanka
    Lee, Dongwon
    Hoffmann, Thomas J.
    Ehret, Georg B.
    Arking, Dan
    Ranatunga, Dilrini
    Li, Man
    Grove, Megan L.
    Boerwinkle, Eric
    Schaefer, Catherine
    Kwok, Pui-Yan
    Iribarren, Carlos
    Risch, Neil
    Chakravarti, Aravinda
    HUMAN MOLECULAR GENETICS, 2020, 29 (11) : 1922 - 1932
  • [23] A survey of ancient conserved non-coding elements in the PAX6 locus reveals a landscape of interdigitated cis-regulatory archipelagos
    Bhatia, Shipra
    Monahan, Jack
    Ravi, Vydianathan
    Gautier, Philippe
    Murdoch, Emma
    Brenner, Sydney
    van Heyningen, Veronica
    Venkatesh, Byrappa
    Kleinjan, Dirk A.
    DEVELOPMENTAL BIOLOGY, 2014, 387 (02) : 214 - 228
  • [24] Mapping the Cis-Regulatory Architecture of the Human Retina Reveals Non-Coding Genetic Variation in Disease
    Cherry, Timothy
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2019, 60 (09)
  • [25] Non-coding variants impact cis-regulatory coordination in a cell type-specific manner
    Pushkarev, Olga
    van Mierlo, Guido
    Kribelbauer, Judith Franziska
    Saelens, Wouter
    Gardeux, Vincent
    Deplancke, Bart
    GENOME BIOLOGY, 2024, 25 (01):
  • [26] A Catalogue of Putative cis-Regulatory Interactions Between Long Non-coding RNAs and Proximal Coding Genes Based on Correlative Analysis Across Diverse Human Tumors
    Basu, Swaraj
    Larsson, Erik
    G3-GENES GENOMES GENETICS, 2018, 8 (06): : 2019 - 2025
  • [27] Identification of cis-regulatory elements in the epidermal differentiation complex (EDC)
    Strong, C. de Guzman
    Sears, K.
    Segre, J. A.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2009, 129 : S90 - S90
  • [28] Identification of cis-Regulatory Elements in the dmyc Gene of Drosophila Melanogaster
    Kharazmi, Jasmine
    Moshfegh, Cameron
    Brody, Thomas
    GENE REGULATION AND SYSTEMS BIOLOGY, 2012, 6 : 15 - 42
  • [29] Identification and Conservation Analysis of Cis-Regulatory Elements in Pig Liver
    Luan, Yu
    Zhang, Lu
    Hu, Mingyang
    Xu, Yueyuan
    Hou, Ye
    Li, Xinyun
    Zhao, Shuhong
    Zhao, Yunxia
    Li, Changchun
    GENES, 2019, 10 (05):
  • [30] Identification of cis-Regulatory Elements in the Mammalian Genome: The cREMaG Database
    Piechota, Marcin
    Korostynski, Michal
    Przewlocki, Ryszard
    PLOS ONE, 2010, 5 (08):