Bias and Correction in RNA-seq Data for Marine Species

被引:0
|
作者
Kai Song
Li Li
Guofan Zhang
机构
[1] Chinese Academy of Sciences,Key Laboratory of Experimental Marine Biology, Institute of Oceanology
[2] National & Local Joint Engineering Laboratory of Ecological Mariculture,Laboratory for Marine Fisheries and Aquaculture
[3] Qingdao National Laboratory for Marine Science and Technology,Laboratory for Marine Biology and Biotechnology
[4] Qingdao National Laboratory for Marine Science and Technology,undefined
来源
Marine Biotechnology | 2017年 / 19卷
关键词
Transcriptome profiling; RNA-seq analysis bias; Gene expression; GC content;
D O I
暂无
中图分类号
学科分类号
摘要
RNA-seq is a recently developed approach widely used for transcriptome profiling in biological analyses that use next-generation sequencing technologies. Accurate estimation of gene expression levels is critical for answering biological questions. Here, we show that the commonly used measure of gene expression levels, fragments per kilobase of transcript per million mapped reads (FPKM), is biased in transcript length, GC content, and dinucleotide frequencies in the RNA-seq analysis of marine species. We used a generalized linear model to correct the observed biases of FPKM. We used RNA-seq data sets from eight species obtained by different sequencing methods to evaluate the correction methods. Our work contributes to the understanding of potential technical artifacts in RNA-seq experiments for marine species, and presents a means by which more accurate gene expression measures can be obtained.
引用
收藏
页码:541 / 550
页数:9
相关论文
共 50 条
  • [41] An integrative method to normalize RNA-Seq data
    Filloux, Cyril
    Cedric, Meersseman
    Romain, Philippe
    Lionel, Forestier
    Christophe, Klopp
    Dominique, Rocha
    Abderrahman, Maftah
    Daniel, Petit
    BMC BIOINFORMATICS, 2014, 15
  • [42] Mining RNA-Seq Data for Infections and Contaminations
    Bonfert, Thomas
    Csaba, Gergely
    Zimmer, Ralf
    Friedel, Caroline C.
    PLOS ONE, 2013, 8 (09):
  • [43] RNAseqViewer: visualization tool for RNA-Seq data
    Roge, Xavier
    Zhang, Xuegong
    BIOINFORMATICS, 2014, 30 (06) : 891 - 892
  • [44] Pathogen detection in RNA-seq data with Pathonoia
    Anna-Maria Liebhoff
    Kevin Menden
    Alena Laschtowitz
    Andre Franke
    Christoph Schramm
    Stefan Bonn
    BMC Bioinformatics, 24
  • [45] RNA-Seq data: a goldmine for organelle research
    Smith, David Roy
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2013, 12 (05) : 454 - 456
  • [46] An Integrated Approach for RNA-seq Data Normalization
    Yang, Shengping
    Mercante, Donald E.
    Zhang, Kun
    Fang, Zhide
    CANCER INFORMATICS, 2016, 15 : 129 - 141
  • [47] Statistical analysis of RNA-seq data at scale
    Leek, Jeff T.
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 563 - 563
  • [48] Randomized Adapters for Reducing Bias in Small RNA-Seq Libraries
    不详
    BIOTECHNIQUES, 2014, 56 (02) : 95 - 96
  • [49] Publisher Correction: Challenges in unsupervised clustering of single-cell RNA-seq data
    Vladimir Yu Kiselev
    Tallulah S. Andrews
    Martin Hemberg
    Nature Reviews Genetics, 2019, 20 : 310 - 310
  • [50] Evaluating the impact of sequencing error correction for RNA-seq data with ERCC RNA spike-in controls
    Tong, Li
    Yang, Cheng
    Wu, Po-Yen
    Wang, May D.
    2016 3RD IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, 2016, : 74 - 77