Gene Annotation and Transcriptome Delineation on a De Novo Genome Assembly for the Reference Leishmania major Friedlin Strain

被引:10
|
作者
Camacho, Esther [1 ]
Gonzalez-de la Fuente, Sandra [1 ]
Solana, Jose C. [1 ]
Rastrojo, Alberto [1 ]
Carrasco-Ramiro, Fernando [1 ]
Requena, Jose M. [1 ]
Aguado, Begona [1 ]
机构
[1] Univ Autonoma Madrid, Ctr Biol Mol Severo Ochoa CBMSO, CSIC UAM, Campus Excelencia Int CEI UAM CSIC, Madrid 28049, Spain
关键词
genome; transcriptome; gene models; Leishmania; Illumina sequencing; PacBio sequencing; expression levels; untranslated regions (UTR); SL-additions site (SAS); polyadenylation site (PAS); EXPRESSION; ORGANIZATION; PARASITE; SEQUENCE; IDENTIFICATION; VISUALIZATION; DONOVANI; FAMILY; STAGE;
D O I
10.3390/genes12091359
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Leishmania major is the main causative agent of cutaneous leishmaniasis in humans. The Friedlin strain of this species (LmjF) was chosen when a multi-laboratory consortium undertook the objective of deciphering the first genome sequence for a parasite of the genus Leishmania. The objective was successfully attained in 2005, and this represented a milestone for Leishmania molecular biology studies around the world. Although the LmjF genome sequence was done following a shotgun strategy and using classical Sanger sequencing, the results were excellent, and this genome assembly served as the reference for subsequent genome assemblies in other Leishmania species. Here, we present a new assembly for the genome of this strain (named LMJFC for clarity), generated by the combination of two high throughput sequencing platforms, Illumina short-read sequencing and PacBio Single Molecular Real-Time (SMRT) sequencing, which provides long-read sequences. Apart from resolving uncertain nucleotide positions, several genomic regions were reorganized and a more precise composition of tandemly repeated gene loci was attained. Additionally, the genome annotation was improved by adding 542 genes and more accurate coding-sequences defined for around two hundred genes, based on the transcriptome delimitation also carried out in this work. As a result, we are providing gene models (including untranslated regions and introns) for 11,238 genes. Genomic information ultimately determines the biology of every organism; therefore, our understanding of molecular mechanisms will depend on the availability of precise genome sequences and accurate gene annotations. In this regard, this work is providing an improved genome sequence and updated transcriptome annotations for the reference L. major Friedlin strain.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Resequencing and assembly of seven complex loci to improve the Leishmania major (Friedlin strain) reference genome
    Graciela Alonso
    Alberto Rastrojo
    Sara López-Pérez
    Jose M. Requena
    Begoña Aguado
    [J]. Parasites & Vectors, 9
  • [2] Resequencing and assembly of seven complex loci to improve the Leishmania major (Friedlin strain) reference genome
    Alonso, Graciela
    Rastrojo, Alberto
    Lopez-Perez, Sara
    Requena, Jose M.
    Aguado, Begona
    [J]. PARASITES & VECTORS, 2016, 9
  • [3] The complete chromosomal organization of the reference strain of the Leishmania genome project, L. major 'Friedlin'
    Ravel, C
    Dubessay, P
    Bastien, P
    [J]. PARASITOLOGY TODAY, 1998, 14 (08): : 301 - 303
  • [4] The Rhinella arenarum transcriptome: de novo assembly, annotation and gene prediction
    Danilo Guillermo Ceschin
    Natalia Susana Pires
    Mariana Noelia Mardirosian
    Cecilia Inés Lascano
    Andrés Venturino
    [J]. Scientific Reports, 10
  • [5] De novo transcriptome assembly and gene annotation for the toxic dinoflagellate Dinophysis
    Chetan C. Gaonkar
    Lisa Campbell
    [J]. Scientific Data, 10
  • [6] The Rhinella arenarum transcriptome: de novo assembly, annotation and gene prediction
    Guillermo Ceschin, Danilo
    Susana Pires, Natalia
    Noelia Mardirosian, Mariana
    Ines Lascano, Cecilia
    Venturino, Andres
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [7] De novo transcriptome assembly and gene annotation for the toxic dinoflagellate Dinophysis
    Gaonkar, Chetan C.
    Campbell, Lisa
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [8] Complete assembly of the Leishmania donovani (HU3 strain) genome and transcriptome annotation
    Camacho, Esther
    Gonzalez-de La Fuent, Sandra
    Rastrojo, Alberto
    Peiro-Pastor, Ramon
    Solana, Jose Carlos
    Tabera, Laura
    Gamarro, Francisco
    Carrasco-Ramiro, Fernando
    Requen, Jose M.
    Aguado, Begoiia
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [9] Complete assembly of the Leishmania donovani (HU3 strain) genome and transcriptome annotation
    Esther Camacho
    Sandra González-de la Fuente
    Alberto Rastrojo
    Ramón Peiró-Pastor
    Jose Carlos Solana
    Laura Tabera
    Francisco Gamarro
    Fernando Carrasco-Ramiro
    Jose M. Requena
    Begoña Aguado
    [J]. Scientific Reports, 9
  • [10] De novo transcriptome assembly and annotation for gene discovery in avocado, macadamia and mango
    Chabikwa, Tinashe G.
    Barbier, Francois F.
    Tanurdzic, Milos
    Beveridge, Christine A.
    [J]. SCIENTIFIC DATA, 2020, 7 (01)