Discovering novel reproductive genes in a non-model fly using de novo GridION transcriptomics

被引:1
|
作者
Walter, Mrinalini [1 ]
Puniamoorthy, Nalini [1 ]
机构
[1] Natl Univ Singapore, Dept Biol Sci, Singapore, Singapore
关键词
gene expression; GridION; Illumina; novel gene; Oxford Nanopore Technologies (ONT); reproduction; Sepsis punctum; sexual selection; SEXUAL SIZE DIMORPHISM; ACCESSORY-GLAND PROTEINS; SEMINAL FLUID PROTEINS; DROSOPHILA-MELANOGASTER; POSITIVE SELECTION; RAPID EVOLUTION; MOLECULAR CHARACTERIZATION; DUNG FLY; CD-HIT; SPERM;
D O I
10.3389/fgene.2022.1003771
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene discovery has important implications for investigating phenotypic trait evolution, adaptation, and speciation. Male reproductive tissues, such as accessory glands (AGs), are hotspots for recruitment of novel genes that diverge rapidly even among closely related species/populations. These genes synthesize seminal fluid proteins that often affect post-copulatory sexual selection-they can mediate male-male sperm competition, ejaculate-female interactions that modify female remating and even influence reproductive incompatibilities among diverging species/populations. Although de novo transcriptomics has facilitated gene discovery in non-model organisms, reproductive gene discovery is still challenging without a reference database as they are often novel and bear no homology to known proteins. Here, we use reference-free GridION long-read transcriptomics, from Oxford Nanopore Technologies (ONT), to discover novel AG genes and characterize their expression in the widespread dung fly, Sepsis punctum. Despite stark population differences in male reproductive traits (e.g.: Body size, testes size, and sperm length) as well as female re-mating, the male AG genes and their secretions of S. punctum are still unknown. We implement a de novo ONT transcriptome pipeline incorporating quality-filtering and rigorous error-correction procedures, and we evaluate gene sequence and gene expression results against high-quality Illumina short-read data. We discover highly-expressed reproductive genes in AG transcriptomes of S. punctum consisting of 40 high-quality and high-confidence ONT genes that cross-verify against Illumina genes, among which 26 are novel and specific to S. punctum. Novel genes account for an average of 81% of total gene expression and may be functionally relevant in seminal fluid protein production. For instance, 80% of genes encoding secretory proteins account for 74% total gene expression. In addition, median sequence similarities of ONT nucleotide and protein sequences match within-Illumina sequence similarities. Read-count based expression quantification in ONT is congruent with Illumina's Transcript per Million (TPM), both in overall pattern and within functional categories. Rapid genomic innovation followed by recruitment of de novo genes for high expression in S. punctum AG tissue, a pattern observed in other insects, could be a likely mechanism of evolution of these genes. The study also demonstrates the feasibility of adapting ONT transcriptomics for gene discovery in non-model systems.
引用
下载
收藏
页数:16
相关论文
共 50 条
  • [1] Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
    Joyce, Blake L.
    Haug-Baltzell, Asher K.
    Hulvey, Jonathan P.
    McCarthy, Fiona
    Devisetty, Upendra Kumar
    Lyons, Eric
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2017, (123):
  • [2] De novo transcriptome sequencing of a non-model polychaete species
    Cannarsa, E.
    Zampicinini, G.
    Friard, O.
    Santovito, A.
    Cervella, P.
    MARINE GENOMICS, 2016, 29 : 31 - 34
  • [3] Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal
    Etherington, Graham J.
    Heavens, Darren
    Baker, David
    Lister, Ashleigh
    McNelly, Rose
    Garcia, Gonzalo
    Clavijo, Bernardo
    Macaulay, Iain
    Haerty, Wilfried
    Di Palma, Federica
    GIGASCIENCE, 2020, 9 (05):
  • [4] TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms
    Pedro Seoane
    Marina Espigares
    Rosario Carmona
    Álvaro Polonio
    Julia Quintana
    Enrico Cretazzo
    Josefina Bota
    Alejandro Pérez-García
    Juan de Dios Alché
    Luis Gómez
    M. Gonzalo Claros
    BMC Bioinformatics, 19
  • [5] A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
    Sze, Sing-Hoi
    Pimsler, Meaghan L.
    Tomberlin, Jeffery K.
    Jones, Corbin D.
    Tarone, Aaron M.
    BMC GENOMICS, 2017, 18
  • [6] A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
    Sing-Hoi Sze
    Meaghan L. Pimsler
    Jeffery K. Tomberlin
    Corbin D. Jones
    Aaron M. Tarone
    BMC Genomics, 18
  • [7] Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: A comparison of de novo assemblers
    Amin S.
    Prentis P.J.
    Gilding E.K.
    Pavasovic A.
    BMC Research Notes, 7 (1)
  • [8] TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms
    Seoane, Pedro
    Espigares, Marina
    Carmona, Rosario
    Polonio, Alvaro
    Quintana, Julia
    Cretazzo, Enrico
    Bota, Josefina
    Perez-Garcia, Alejandro
    de Dios Alche, Juan
    Gomez, Luis
    Gonzalo Claros, M.
    BMC BIOINFORMATICS, 2018, 19
  • [9] Obtaining the Most Accurate de novo Transcriptomes for Non-model Organisms: The Case of Castanea sativa
    Espigares, Marina
    Seoane, Pedro
    Bautista, Rocio
    Quintana, Julia
    Gomez, Luis
    Gonzalo Claros, M.
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 489 - 499
  • [10] Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species
    Peterson, Brant K.
    Weber, Jesse N.
    Kay, Emily H.
    Fisher, Heidi S.
    Hoekstra, Hopi E.
    PLOS ONE, 2012, 7 (05):