Large-Scale Detection of Telomeric Motif Sequences in Genomic Data Using TelFinder

被引:7
|
作者
Sun, Qing [1 ,2 ,3 ]
Wang, Hao [1 ,2 ,3 ]
Tao, Shiheng [1 ,2 ,3 ]
Xi, Xuguang [1 ,4 ]
机构
[1] Northwest A&F Univ, Coll Life Sci, Yangling, Shaanxi, Peoples R China
[2] Northwest A&F Univ, Bioinformat Ctr, Yangling, Shaanxi, Peoples R China
[3] Northwest A&F Univ, State Key Lab Crop Stress Biol Arid Areas, Yangling, Shaanxi, Peoples R China
[4] Univ Paris Saclay, CNRS, LBPA, ENS Paris Saclay, Gif Sur Yvette, France
来源
MICROBIOLOGY SPECTRUM | 2023年 / 11卷 / 02期
基金
中国国家自然科学基金;
关键词
detect telomeric motif sequence; fungi; variation in telomere; VERTEBRATE; REPEATS; IDENTIFICATION; CHROMOSOME; TER1;
D O I
10.1128/spectrum.03928-22
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Telomeres are regions of tandem repeated sequences at the ends of linear chromosomes that protect against DNA damage and chromosome fusion. Telomeres are associated with senescence and cancers and have attracted the attention of an increasing number of researchers. However, few telomeric motif sequences are known. Given the mounting interest in telomeres, an efficient computational tool for the de novo detection of the telomeric motif sequence of new species is needed since experimental-based methods are costly in terms of time and effort. Here, we report the development of TelFinder, an easy-to-use and freely available tool for the de novo detection of telomeric motif sequences from genomic data. The vast quantity of readily available genomic data makes it possible to apply this tool to any species of interest, which will undoubtedly inspire studies requiring telomeric repeat information and improve the utilization of these genomic data sets. We have tested TelFinder on telomeric sequences available in the Telomerase Database, and the detection accuracy reaches 90%. In addition, variation analyses in telomere sequences can be performed by TelFinder for the first time. The telomere variation preference of different chromosomes and even at the ends of the chromosome can provide clues regarding the underlying mechanisms of telomeres. Overall, these results shed new light on the divergent evolution of telomeres. IMPORTANCE Telomeres are reported to be highly correlated with the cell cycle and aging. As a result, research on telomere composition and evolution has become more and more urgent. However, using experimental methods to detect telomeric motif sequences is slow and costly. To combat this challenge, we developed TelFinder, a computational tool for the de novo detection of the telomere composition only using genomic data. In this study, we showed that a lot of complicated telomeric motifs could be identified by TelFinder only using genomic data. In addition, TelFinder can be used to check variation analyses in telomere sequences, which could lead to a deeper understanding of telomere sequences.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Large-Scale Detection of Telomeric Motif Sequences in Genomic Data Using TelFinder (vol 11, e03928-22, 2023)
    Sun, Qing
    Wang, Hao
    Tao, Shiheng
    Xi, Xuguang
    [J]. MICROBIOLOGY SPECTRUM, 2023, 11 (06):
  • [2] Visualizing large-scale genomic sequences
    Glusman, G
    Lancet, D
    [J]. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2001, 20 (04): : 49 - 54
  • [3] Automated gene identification in large-scale genomic sequences
    Xu, Y
    Uberbacher, EC
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (03) : 325 - 338
  • [4] Large-scale detection of recombination in nucleotide sequences
    Chan, Cheong Xin
    Beiko, Robert G.
    Ragan, Mark A.
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICAL BIOLOGY 2007, 2008, 971 : 101 - +
  • [5] A large-scale comparison of genomic sequences: One promising approach
    Kirzhner, V
    Nevo, E
    Korol, A
    Bolshoy, A
    [J]. ACTA BIOTHEORETICA, 2003, 51 (02) : 73 - 89
  • [6] A Large-Scale Comparison of Genomic Sequences: One Promising Approach
    Valery Kirzhner
    Eviatar Nevo
    Abraham Korol
    Alexander Bolshoy
    [J]. Acta Biotheoretica, 2003, 51 : 73 - 89
  • [7] Large-scale network motif analysis using compression
    Peter Bloem
    Steven de Rooij
    [J]. Data Mining and Knowledge Discovery, 2020, 34 : 1421 - 1453
  • [8] Large-scale network motif analysis using compression
    Bloem, Peter
    de Rooij, Steven
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (05) : 1421 - 1453
  • [9] Analyzing large-scale genomic data with cloud data lakes
    Weintraub, Grisha
    Hadar, Noam
    Gudes, Ehud
    Dolev, Shlomi
    Birk, Ohad S.
    [J]. PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 142 - 142
  • [10] Identifying Recent Adaptations in Large-Scale Genomic Data
    Grossman, Sharon R.
    Andersen, Kristian G.
    Shlyakhter, Ilya
    Tabrizi, Shervin
    Winnicki, Sarah
    Yen, Angela
    Park, Daniel J.
    Griesemer, Dustin
    Karlsson, Elinor K.
    Wong, Sunny H.
    Cabili, Moran
    Adegbola, Richard A.
    Bamezai, Rameshwar N. K.
    Hill, Adrian V. S.
    Vannberg, Fredrik O.
    Rinn, John L.
    Lander, Eric S.
    Schaffner, Stephen F.
    Sabeti, Pardis C.
    [J]. CELL, 2013, 152 (04) : 703 - 713