ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation

被引:7
|
作者
Shamanskiy, Viktor N. [1 ]
Timonina, Valeria N. [1 ]
Popadin, Konstantin Yu. [1 ,2 ,3 ]
Gunbin, Konstantin V. [1 ,4 ]
机构
[1] Immanuel Kant Balt Fed Univ, Sch Life Sci, Ctr Mitochondrial Funct Genom, Kaliningrad, Russia
[2] Univ Lausanne, Ctr Integrat Genom, Lausanne, Switzerland
[3] Swiss Inst Bioinformat, Lausanne, Switzerland
[4] RAS, SB, Inst Cytol & Genet, Ctr Brain Neurobiol & Neurogenet, Novosibirsk, Russia
基金
俄罗斯基础研究基金会;
关键词
mtDNA; Imperfect repeats; Database; Selection on dinucleotides; DE-NOVO IDENTIFICATION; TANDEM REPEATS; METABOLIC-RATE; DNA-SEQUENCES; TOOL; MICROSATELLITES; DOTPLOTS; FINDER; EFFICIENT; DELETIONS;
D O I
10.1186/s12864-019-5536-1
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundMitochondria is a powerhouse of all eukaryotic cells that have its own circular DNA (mtDNA) encoding various RNAs and proteins. Somatic perturbations of mtDNA are accumulating with age thus it is of great importance to uncover the main sources of mtDNA instability. Recent analyses demonstrated that somatic mtDNA deletions depend on imperfect repeats of various nature between distant mtDNA segments. However, till now there are no comprehensive databases annotating all types of imperfect repeats in numerous species with sequenced complete mitochondrial genome as well as there are no algorithms capable to call all types of imperfect repeats in circular mtDNA.ResultsWe implemented naive algorithm of pattern recognition by analogy to standard dot-plot construction procedures allowing us to find both perfect and imperfect repeats of four main types: direct, inverted, mirror and complementary. Our algorithm is adapted to specific characteristics of mtDNA such as circularity and an excess of short repeats - it calls imperfect repeats starting from the length of 10b.p. We constructed interactive web available database ImtRDB depositing perfect and imperfect repeats positions in mtDNAs of more than 3500 Vertebrate species. Additional tools, such as visualization of repeats within a genome, comparison of repeat densities among different genomes and a possibility to download all results make this database useful for many biologists. Our first analyses of the database demonstrated that mtDNA imperfect repeats (i) are usually short; (ii) associated with unfolded DNA structures; (iii) four types of repeats positively correlate with each other forming two equivalent pairs: direct and mirror versus inverted and complementary, with identical nucleotide content and similar distribution between species; (iv) abundance of repeats is negatively associated with GC content; (v) dinucleotides GC versus CG are overrepresented on light chain of mtDNA covered by repeats.ConclusionsImtRDB is available at http://bioinfodbs.kantiana.ru/ImtRDB/. It is accompanied by the software calling all types of interspersed repeats with different level of degeneracy in circular DNA. This database and software can become a very useful tool in various areas of mitochondrial and chloroplast DNA research.
引用
收藏
页数:17
相关论文
共 19 条
  • [1] ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation
    Viktor N. Shamanskiy
    Valeria N. Timonina
    Konstantin Yu. Popadin
    Konstantin V. Gunbin
    BMC Genomics, 20
  • [2] Correction to: ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation
    Viktor A. Shamanskiy
    Valeria N. Timonina
    Konstantin Yu. Popadin
    Konstantin V. Gunbin
    BMC Genomics, 20
  • [3] mtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation (vol 20, 295, 2019)
    Shamanskiy, Viktor A.
    Timonina, Valeria N.
    Popadin, Konstantin Yu.
    Gunbin, Konstantin V.
    BMC GENOMICS, 2019, 20 (1)
  • [4] EDIR: exome database of interspersed repeats
    Vo Ngoc, Laura D. T.
    Osei, Randy
    Dohr, Katrin
    Olsen, Catharina
    Seneca, Sara
    Gheldof, Alexander
    BIOINFORMATICS, 2023, 39 (01)
  • [5] Computation of a database of interspersed repeats in coding regions of the human genome
    Ngoc, Doan T. L. Vo
    Osei, Randy
    Dohr, Katrien
    Olsen, Catharina
    Stouffs, Katrien
    Sermon, Karen
    Seneca, Sara
    Gheldof, Alexander
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 491 - 491
  • [6] LOVD: variant annotation software and a public database
    Kroon, M.
    den Dunnen, J. T.
    Fokkema, I. F. A. C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 709 - 710
  • [7] 3′-UTR SIRF:: A database for identifying clusters of whort interspersed repeats in 3′ untranslated regions
    Andken, Benjamin B.
    Lim, In
    Benson, Gary
    Vincent, John J.
    Ferenc, Matthew T.
    Heinrich, Bianca
    Jarzylo, Larissa A.
    Man, Heng-Ye
    Deshler, James O.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [8] 3'-UTR SIRF: A database for identifying clusters of short interspersed repeats in 3' untranslated regions
    Benjamin B Andken
    In Lim
    Gary Benson
    John J Vincent
    Matthew T Ferenc
    Bianca Heinrich
    Larissa A Jarzylo
    Heng-Ye Man
    James O Deshler
    BMC Bioinformatics, 8
  • [9] MitoProteome: mitochondrial protein sequence database and annotation system
    Cotter, D
    Guda, P
    Fahy, E
    Subramaniam, S
    NUCLEIC ACIDS RESEARCH, 2004, 32 : D463 - D467
  • [10] Gene annotation errors are common in the mammalian mitochondrial genomes database
    Prada, Carlos F.
    Boore, Jeffrey L.
    BMC GENOMICS, 2019, 20 (1)