Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

被引:11
|
作者
Guo, Xiang [1 ,2 ]
Silva, Joana C. [1 ,3 ,4 ]
机构
[1] J Craig Venter Inst, Inst Genom Res, Rockville, MD 20850 USA
[2] SAIC Frederick Inc, NCI Frederick, Adv Biomed Comp Ctr, Frederick, MD 21702 USA
[3] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[4] Univ Maryland, Sch Med, Dept Microbiol & Immunol, Baltimore, MD 21201 USA
关键词
D O I
10.1186/1471-2164-9-582
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results: Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know regulatory motifs in other species. These results suggest that these two motifs are likely to represent transcription factor binding sites in Theileria. Conclusion: Theileria genomes are highly compact, with selection seemingly favoring short introns and intergenic regions. Three over-represented sequence motifs were independently identified in intergenic regions of both Theileria species, and the evidence suggests that at least two of them play a role in transcriptional control in T. parva. These are prime candidates for experimental validation of transcription factor binding sites in this single-celled eukaryotic parasite. Sequences similar to two of these Theileria motifs are conserved in Plasmodium hinting at the possibility of common regulatory machinery across the phylum Apicomplexa.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Analysis of mammalian cis-regulatory DNA elements by homologous recombination
    Fiering, S
    Bender, MA
    Groudine, M
    EXPRESSION OF RECOMBINANT GENES IN EUKARYOTIC SYSTEMS, 1999, 306 : 42 - 66
  • [32] Identification and analysis of cis-regulatory elements in AGL24
    Kaneko, M
    Takemura, M
    Kohchi, T
    PLANT AND CELL PHYSIOLOGY, 2005, 46 : S131 - S131
  • [33] Identification of cis-regulatory elements for MECP2 expression
    Liu, Jinglan
    Francke, Uta
    HUMAN MOLECULAR GENETICS, 2006, 15 (11) : 1769 - 1782
  • [34] Systems-wide Identification of cis-Regulatory Elements in Proteins
    Yeon, Ju Hun
    Heinkel, Florian
    Sung, Minhui
    Na, Dokyun
    Gsponer, Jorg
    CELL SYSTEMS, 2016, 2 (02) : 89 - 100
  • [35] Epigenomic identification of vernalization cis-regulatory elements in winter wheat
    Liu, Yanhong
    Liu, Pan
    Gao, Lifeng
    Li, Yushan
    Ren, Xueni
    Jia, Jizeng
    Wang, Lei
    Zheng, Xu
    Tong, Yiping
    Pei, Hongcui
    Lu, Zefu
    GENOME BIOLOGY, 2024, 25 (01):
  • [36] Putative cis-regulatory elements in genes highly expressed in rice sperm cells
    Sharma N.
    Russell S.D.
    Bhalla P.L.
    Singh M.B.
    BMC Research Notes, 4 (1)
  • [37] Core genes of biomineralization and cis-regulatory long non-coding RNA regulate shell growth in bivalves
    Peng, Maoxiao
    Cardoso, Joao C. R.
    Pearson, Gareth
    Canario, Adelino V. M.
    Power, Deborah M.
    JOURNAL OF ADVANCED RESEARCH, 2024, 64 : 117 - 129
  • [38] Combined population transcriptomic and genomic analysis reveals cis-regulatory differentiation of non-coding RNAs in maize
    Lu, Jiawen
    Zhen, Sihan
    Zhang, Jie
    Xie, Yuxin
    He, Cheng
    Wang, Xiaoli
    Wang, Zheyuan
    Zhang, Song
    Li, Yongxiang
    Cui, Yu
    Wang, Guoying
    Wang, Jianhua
    Liu, Jun
    Li, Lin
    Gu, Riliang
    Zheng, Xiaoming
    Fu, Junjie
    THEORETICAL AND APPLIED GENETICS, 2023, 136 (01) : 1 - 13
  • [39] Combined population transcriptomic and genomic analysis reveals cis-regulatory differentiation of non-coding RNAs in maize
    Jiawen Lu
    Sihan Zhen
    Jie Zhang
    Yuxin Xie
    Cheng He
    Xiaoli Wang
    Zheyuan Wang
    Song Zhang
    Yongxiang Li
    Yu Cui
    Guoying Wang
    Jianhua Wang
    Jun Liu
    Lin Li
    Riliang Gu
    Xiaoming Zheng
    Junjie Fu
    Theoretical and Applied Genetics, 2023, 136
  • [40] Identification of functional cis-regulatory elements by sequential enrichment from a randomized synthetic DNA library
    Mario Roccaro
    Nahal Ahmadinejad
    Thomas Colby
    Imre E Somssich
    BMC Plant Biology, 13