A systematic comparison of normalization methods for eQTL analysis

被引:7
|
作者
Yang, Jiajun [1 ]
Wang, Dongyang [1 ]
Yang, Yanbo [1 ]
Yang, Wenqian [1 ]
Jin, Weiwei [1 ]
Niu, Xiaohui [1 ]
Gong, Jing [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
normalization; expression quantitative trait loci; eQTL; RNA-Seq data; gene expression; GENOME-WIDE ASSOCIATION; TRANS-EQTLS; IDENTIFICATION; DRIVERS; LOCI;
D O I
10.1093/bib/bbab193
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Expression quantitative trait loci (eQTL) analysis has been widely used in interpreting disease-associated loci through correlating genetic variant loci with the expression of specific genes. RNA-sequencing (RNA-Seq), which can quantify gene expression at the genome-wide level, is often used in eQTL identification. Since different normalization methods of gene expression have substantial impacts on RNA-seq downstream analysis, it is of great necessity to systematically compare the effects of these methods on eQTL identification. Here, by using RNA-seq and genotype data of four different cancers in The Cancer Genome Atlas (TCGA) database, we comprehensively evaluated the effect of eight commonly used normalization methods on eQTL identification. Our results showed that the application of different methods could cause 20-30% differences in the final results of eQTL identification. Among these methods, COUNT, Median of Ratio (MED) and Trimmed Mean of M-values (TMM) generated similar results for identifying eQTLs, while Fragments Per Kilobase Million (FPKM) or RANK produced more differential results compared with other methods. Based on the accuracy and receiver operating characteristic (ROC) curve, the TMM method was found to be the optimal method for normalizing gene expression data in eQTLs analysis. In addition, we also evaluated the performance of different pairwise combinations of these methods. As a result, compared with single normalization methods, the combination of methods can not only identify more cis-eQTLs, but also improve the performance of the ROC curve. Overall, this study provides a comprehensive comparison of normalization methods for identifying eQTLs from RNA-seq data, and proposes some practical recommendations for diverse scenarios.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Comparison of normalization methods for CodeLink Bioarray data
    Wu, W
    Dave, N
    Tseng, GC
    Richards, T
    Xing, EP
    Kaminski, N
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [12] Comparison of normalization methods for CodeLink Bioarray data
    Wei Wu
    Nilesh Dave
    George C Tseng
    Thomas Richards
    Eric P Xing
    Naftali Kaminski
    BMC Bioinformatics, 6
  • [13] The comparison of different normalization methods in microarray data
    Tan Xiao-Jun
    Zhang Yong-Xin
    Qian Min-Ping
    Zhang You-Yi
    Deng Ming-Hua
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2007, 34 (06) : 625 - 633
  • [14] Robust Statistical Methods For Genome-Wide Eqtl Analysis
    Rantalainen, Mattias
    Holmes, Chris
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 140 - 140
  • [15] Methods for Population-Based eQTL Analysis in Human Genetics
    Lu Tian
    Andrew Quitadamo
    Frederick Lin
    Xinghua Shi
    Tsinghua Science and Technology, 2014, 19 (06) : 624 - 634
  • [16] Methods for Population-Based eQTL Analysis in Human Genetics
    Tian, Lu
    Quitadamo, Andrew
    Lin, Frederick
    Shi, Xinghua
    TSINGHUA SCIENCE AND TECHNOLOGY, 2014, 19 (06) : 624 - 634
  • [17] Comparison of Normalization Methods for Analysis of TempO-Seq Targeted RNA Sequencing Data
    Bushel, Pierre R.
    Ferguson, Stephen S.
    Ramaiahgari, Sreenivasa C.
    Paules, Richard S.
    Auerbach, Scott S.
    FRONTIERS IN GENETICS, 2020, 11
  • [18] Comparison of Protein Solubilization and Normalization Methods for Proteomics Analysis of Extracellular Vesicles from Urine
    Wang, Jun
    Li, Yang
    Wang, Yisheng
    Wang, Guoli
    Zhao, Chenyang
    Zhang, Ying
    Lu, Haojie
    JOURNAL OF PROTEOME RESEARCH, 2025,
  • [19] Comparison of normalization methods for Hi-C data
    Lyu, Hongqiang
    Liu, Erhu
    Wu, Zhifang
    BIOTECHNIQUES, 2020, 68 (02) : 56 - 64
  • [20] Comparison of normalization methods for RNA-Seq data
    Aghababazadeh, Farnoosh A.
    Li, Qian
    Fridley, Brooke L.
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 684 - 684