Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

被引:10
|
作者
Guo, Xiang [1 ,2 ]
Silva, Joana C. [1 ,3 ,4 ]
机构
[1] J Craig Venter Inst, Inst Genom Res, Rockville, MD 20850 USA
[2] SAIC Frederick Inc, NCI Frederick, Adv Biomed Comp Ctr, Frederick, MD 21702 USA
[3] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[4] Univ Maryland, Sch Med, Dept Microbiol & Immunol, Baltimore, MD 21201 USA
关键词
D O I
10.1186/1471-2164-9-582
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results: Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know regulatory motifs in other species. These results suggest that these two motifs are likely to represent transcription factor binding sites in Theileria. Conclusion: Theileria genomes are highly compact, with selection seemingly favoring short introns and intergenic regions. Three over-represented sequence motifs were independently identified in intergenic regions of both Theileria species, and the evidence suggests that at least two of them play a role in transcriptional control in T. parva. These are prime candidates for experimental validation of transcription factor binding sites in this single-celled eukaryotic parasite. Sequences similar to two of these Theileria motifs are conserved in Plasmodium hinting at the possibility of common regulatory machinery across the phylum Apicomplexa.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva
    Xiang Guo
    Joana C Silva
    [J]. BMC Genomics, 9
  • [2] Identification of clade-wide putative cis-regulatory elements from conserved non-coding sequences in Cucurbitaceae genomes
    Song, Hongtao
    Wang, Qi
    Zhang, Zhonghua
    Lin, Kui
    Pang, Erli
    [J]. HORTICULTURE RESEARCH, 2023, 10 (04)
  • [3] Non-coding transcription at cis-regulatory elements: Computational and experimental approaches
    Simonatto, Marta
    Barozzi, Iros
    Natoli, Gioacchino
    [J]. METHODS, 2013, 63 (01) : 66 - 75
  • [4] Machine Learning Prediction of Non-Coding Variant Impact in Human Retinal cis-Regulatory Elements
    VandenBosch, Leah S.
    Luu, Kelsey
    Timms, Andrew E.
    Challam, Shriya
    Wu, Yue
    Lee, Aaron Y.
    Cherry, Timothy J.
    [J]. TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2022, 11 (04):
  • [5] Profiling of conserved non-coding elements upstream of SHOX and functional characterisation of the SHOX cis-regulatory landscape
    Hannah Verdin
    Ana Fernández-Miñán
    Sara Benito-Sanz
    Sandra Janssens
    Bert Callewaert
    Kathleen De Waele
    Jean De Schepper
    Inge François
    Björn Menten
    Karen E. Heath
    José Luis Gómez-Skarmeta
    Elfride De Baere
    [J]. Scientific Reports, 5
  • [6] Profiling of conserved non-coding elements upstream of SHOX and functional characterisation of the SHOX cis-regulatory landscape
    Verdin, Hannah
    Fernandez-Minan, Ana
    Benito-Sanz, Sara
    Janssens, Sandra
    Callewaert, Bert
    De Waele, Kathleen
    De Schepper, Jean
    Francois, Inge
    Menten, Bjeorn
    Heath, Karen E.
    Gomez-Skarmeta, Jose Luis
    De Baere, Elfride
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [7] cis-Regulatory Complexity within a Large Non-Coding Region in the Drosophila Genome
    Kundu, Mukta
    Kuzin, Alexander
    Lin, Tzu-Yang
    Lee, Chi-Hon
    Brody, Thomas
    Odenwald, Ward F.
    [J]. PLOS ONE, 2013, 8 (04):
  • [8] Identification of putative cis-regulatory elements in Cryptosporidium parvum by de novo pattern finding
    Nandita Mullapudi
    Cheryl A Lancto
    Mitchell S Abrahamsen
    Jessica C Kissinger
    [J]. BMC Genomics, 8
  • [9] Identification of cis-regulatory elements by chromatin structure
    Lu, Zefu
    Ricci, William A.
    Schmitz, Robert J.
    Zhang, Xiaoyu
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2018, 42 : 90 - 94
  • [10] Identification of putative cis-regulatory elements in Cryptosporidium parvum by de novo pattern finding
    Mullapudi, Nandita
    Lancto, Cheryl A.
    Abrahamsen, Mitchell S.
    Kissinger, Jessica C.
    [J]. BMC GENOMICS, 2007, 8 (1)