Bias and Correction in RNA-seq Data for Marine Species

被引:0
|
作者
Kai Song
Li Li
Guofan Zhang
机构
[1] Chinese Academy of Sciences,Key Laboratory of Experimental Marine Biology, Institute of Oceanology
[2] National & Local Joint Engineering Laboratory of Ecological Mariculture,Laboratory for Marine Fisheries and Aquaculture
[3] Qingdao National Laboratory for Marine Science and Technology,Laboratory for Marine Biology and Biotechnology
[4] Qingdao National Laboratory for Marine Science and Technology,undefined
来源
Marine Biotechnology | 2017年 / 19卷
关键词
Transcriptome profiling; RNA-seq analysis bias; Gene expression; GC content;
D O I
暂无
中图分类号
学科分类号
摘要
RNA-seq is a recently developed approach widely used for transcriptome profiling in biological analyses that use next-generation sequencing technologies. Accurate estimation of gene expression levels is critical for answering biological questions. Here, we show that the commonly used measure of gene expression levels, fragments per kilobase of transcript per million mapped reads (FPKM), is biased in transcript length, GC content, and dinucleotide frequencies in the RNA-seq analysis of marine species. We used a generalized linear model to correct the observed biases of FPKM. We used RNA-seq data sets from eight species obtained by different sequencing methods to evaluate the correction methods. Our work contributes to the understanding of potential technical artifacts in RNA-seq experiments for marine species, and presents a means by which more accurate gene expression measures can be obtained.
引用
收藏
页码:541 / 550
页数:9
相关论文
共 50 条
  • [1] Bias and Correction in RNA-seq Data for Marine Species
    Song, Kai
    Li, Li
    Zhang, Guofan
    MARINE BIOTECHNOLOGY, 2017, 19 (05) : 541 - 550
  • [2] A new approach to bias correction in RNA-Seq
    Jones, Daniel C.
    Ruzzo, Walter L.
    Peng, Xinxia
    Katze, Michael G.
    BIOINFORMATICS, 2012, 28 (07) : 921 - 928
  • [3] Length bias correction for RNA-seq data in gene set analyses
    Gao, Liyan
    Fang, Zhide
    Zhang, Kui
    Zhi, Degui
    Cui, Xiangqin
    BIOINFORMATICS, 2011, 27 (05) : 662 - 669
  • [4] Sequence-specific bias correction for RNA-seq data using recurrent neural networks
    Zhang, Yao-zhong
    Yamaguchi, Rui
    Imoto, Seiya
    Miyano, Satoru
    BMC GENOMICS, 2017, 18
  • [5] Sequence-specific bias correction for RNA-seq data using recurrent neural networks
    Yao-zhong Zhang
    Rui Yamaguchi
    Seiya Imoto
    Satoru Miyano
    BMC Genomics, 18
  • [6] BCseq: accurate single cell RNA-seq quantification with bias correction
    Chen, Liang
    Zheng, Sika
    NUCLEIC ACIDS RESEARCH, 2018, 46 (14)
  • [7] Evaluating the bias of circRNA predictions from total RNA-Seq data
    Wang, Jinzeng
    Liu, Kang
    Liu, Ya
    Lv, Qi
    Zhang, Fan
    Wang, Haiyun
    ONCOTARGET, 2017, 8 (67) : 110914 - 110921
  • [8] Transcript length bias in RNA-seq data confounds systems biology
    Alicia Oshlack
    Matthew J Wakefield
    Biology Direct, 4
  • [9] Transcript length bias in RNA-seq data confounds systems biology
    Oshlack, Alicia
    Wakefield, Matthew J.
    BIOLOGY DIRECT, 2009, 4
  • [10] Bias Correction in RNA-Seq Short-Read Counts Using Penalized Regression
    Dalpiaz D.
    He X.
    Ma P.
    Statistics in Biosciences, 2013, 5 (1) : 88 - 99