Detection of tandem repeats in the Capsicum annuum genome

被引:5
|
作者
Rudenko, Valentina [1 ]
Korotkov, Eugene [1 ]
机构
[1] Russian Acad Sci, Inst Bioengn, Res Ctr Biotechnol, Moscow 119071, Russia
关键词
repeats; genetic algorithm; sequence; Capsicum annuum; DNA-SEQUENCE; PROVIDES INSIGHTS; PERIODICITY; EVOLUTION; SEARCH; IDENTIFICATION; CENTROMERE; REGIONS;
D O I
10.1093/dnares/dsad007
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In this study, we modified the multiple alignment method based on the generation of random position weight matrices (RPWMs) and used it to search for tandem repeats (TRs) in the Capsicum annuum genome. The application of the modified (m)RPWM method, which considers the correlation of adjusting nucleotides, resulted in the identification of 908,072 TR regions with repeat lengths from 2 to 200 bp in the C. annuum genome, where they occupied similar to 29%. The most common TRs were 2 and 3 bp long followed by those of 21, 4, and 15 bp. We performed clustering analysis of TRs with repeat lengths of 2 and 21 bp and created position-weight matrices (PWMs) for each group; these templates could be used to search for TRs of a given length in any nucleotide sequence. All detected TRs can be accessed through publicly available database (http://victoria.biengi.ac.ru/capsicum_tr/). Comparison of mRPWM with other TR search methods such as Tandem Repeat Finder, T-REKS, and XSTREAM indicated that mRPWM could detect significantly more TRs at similar false discovery rates, indicating its superior performance. The developed mRPWM method can be successfully applied to the identification of highly divergent TRs, which is important for functional analysis of genomes and evolutionary studies.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Detection of Highly Divergent Tandem Repeats in the Rice Genome
    Korotkov, Eugene, V
    Kamionskya, Anastasiya M.
    Korotkova, Maria A.
    GENES, 2021, 12 (04)
  • [2] Genome (in)stability at tandem repeats
    Balzano, Elisa
    Pelliccia, Franca
    Giunta, Simona
    SEMINARS IN CELL & DEVELOPMENTAL BIOLOGY, 2021, 113 : 97 - 112
  • [3] Database of Potential Promoter Sequences in the Capsicum annuum Genome
    Rudenko, Valentina
    Korotkov, Eugene
    BIOLOGY-BASEL, 2022, 11 (08):
  • [4] Genome-wide detection of tandem DNA repeats that are expanded in autism
    Trost, Brett
    Engchuan, Worrawat
    Nguyen, Charlotte M.
    Thiruvahindrapuram, Bhooma
    Dolzhenko, Egor
    Backstrom, Ian
    Mirceta, Mila
    Mojarad, Bahareh A.
    Yin, Yue
    Dov, Alona
    Chandrakumar, Induja
    Prasolava, Tanya
    Shum, Natalie
    Hamdan, Omar
    Pellecchia, Giovanna
    Howe, Jennifer L.
    Whitney, Joseph
    Klee, Eric W.
    Baheti, Saurabh
    Amaral, David G.
    Anagnostou, Evdokia
    Elsabbagh, Mayada
    Fernandez, Bridget A.
    Ny Hoang
    Lewis, M. E. Suzanne
    Liu, Xudong
    Sjaarda, Calvin
    Smith, Isabel M.
    Szatmari, Peter
    Zwaigenbaum, Lonnie
    Glazer, David
    Hartley, Dean
    Stewart, A. Keith
    Eberle, Michael A.
    Sato, Nozomu
    Pearson, Christopher E.
    Scherer, Stephen W.
    Yuen, Ryan K. C.
    NATURE, 2020, 586 (7827) : 80 - +
  • [5] Genome-wide detection of tandem DNA repeats that are expanded in autism
    Brett Trost
    Worrawat Engchuan
    Charlotte M. Nguyen
    Bhooma Thiruvahindrapuram
    Egor Dolzhenko
    Ian Backstrom
    Mila Mirceta
    Bahareh A. Mojarad
    Yue Yin
    Alona Dov
    Induja Chandrakumar
    Tanya Prasolava
    Natalie Shum
    Omar Hamdan
    Giovanna Pellecchia
    Jennifer L. Howe
    Joseph Whitney
    Eric W. Klee
    Saurabh Baheti
    David G. Amaral
    Evdokia Anagnostou
    Mayada Elsabbagh
    Bridget A. Fernandez
    Ny Hoang
    M. E. Suzanne Lewis
    Xudong Liu
    Calvin Sjaarda
    Isabel M. Smith
    Peter Szatmari
    Lonnie Zwaigenbaum
    David Glazer
    Dean Hartley
    A. Keith Stewart
    Michael A. Eberle
    Nozomu Sato
    Christopher E. Pearson
    Stephen W. Scherer
    Ryan K. C. Yuen
    Nature, 2020, 586 : 80 - 86
  • [6] Genome-wide detection of somatic mosaicism at short tandem repeats
    Sehgal, Aarushi
    Jam, Helyaneh Ziaei
    Shen, Andrew
    Gymrek, Melissa
    BIOINFORMATICS, 2024, 40 (08)
  • [7] Complete genome sequencing and analysis of Capsicum annuum varieties
    Yul-Kyun Ahn
    Sandeep Karna
    Tae-Hwan Jun
    Eun-Young Yang
    Hye-Eun Lee
    Jin-Hee Kim
    Jeong-Ho Kim
    Molecular Breeding, 2016, 36
  • [8] Complete genome sequencing and analysis of Capsicum annuum varieties
    Ahn, Yul-Kyun
    Karna, Sandeep
    Jun, Tae-Hwan
    Yang, Eun-Young
    Lee, Hye-Eun
    Kim, Jin-Hee
    Kim, Jeong-Ho
    MOLECULAR BREEDING, 2016, 36 (10)
  • [9] Distribution of tandem repeats in human genome
    Fridman, M.
    Kulakovskiy, I.
    Lvovs, D.
    Oparina, N.
    Makeev, V.
    FEBS JOURNAL, 2013, 280 : 20 - 20
  • [10] Tandem repeats in the rodent genome and their mapping
    Ostromyshenskii D.I.
    Kuznetsova I.S.
    Komissarov A.S.
    Kartavtseva I.V.
    Podgornaya O.I.
    Cell and Tissue Biology, 2015, 9 (3) : 217 - 225