A systematic comparison of normalization methods for eQTL analysis

被引:7
|
作者
Yang, Jiajun [1 ]
Wang, Dongyang [1 ]
Yang, Yanbo [1 ]
Yang, Wenqian [1 ]
Jin, Weiwei [1 ]
Niu, Xiaohui [1 ]
Gong, Jing [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
normalization; expression quantitative trait loci; eQTL; RNA-Seq data; gene expression; GENOME-WIDE ASSOCIATION; TRANS-EQTLS; IDENTIFICATION; DRIVERS; LOCI;
D O I
10.1093/bib/bbab193
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Expression quantitative trait loci (eQTL) analysis has been widely used in interpreting disease-associated loci through correlating genetic variant loci with the expression of specific genes. RNA-sequencing (RNA-Seq), which can quantify gene expression at the genome-wide level, is often used in eQTL identification. Since different normalization methods of gene expression have substantial impacts on RNA-seq downstream analysis, it is of great necessity to systematically compare the effects of these methods on eQTL identification. Here, by using RNA-seq and genotype data of four different cancers in The Cancer Genome Atlas (TCGA) database, we comprehensively evaluated the effect of eight commonly used normalization methods on eQTL identification. Our results showed that the application of different methods could cause 20-30% differences in the final results of eQTL identification. Among these methods, COUNT, Median of Ratio (MED) and Trimmed Mean of M-values (TMM) generated similar results for identifying eQTLs, while Fragments Per Kilobase Million (FPKM) or RANK produced more differential results compared with other methods. Based on the accuracy and receiver operating characteristic (ROC) curve, the TMM method was found to be the optimal method for normalizing gene expression data in eQTLs analysis. In addition, we also evaluated the performance of different pairwise combinations of these methods. As a result, compared with single normalization methods, the combination of methods can not only identify more cis-eQTLs, but also improve the performance of the ROC curve. Overall, this study provides a comprehensive comparison of normalization methods for identifying eQTLs from RNA-seq data, and proposes some practical recommendations for diverse scenarios.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A systematic evaluation of normalization methods in quantitative label-free proteomics
    Valikangas, Tommi
    Suomi, Tomi
    Elo, Laura L.
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (01) : 1 - 11
  • [32] Systematic Comparison of Fractionation Methods for In-depth Analysis of Plasma Proteomes
    Cao, Zhijun
    Tang, Hsin-Yao
    Wang, Huan
    Liu, Qin
    Speicher, David W.
    JOURNAL OF PROTEOME RESEARCH, 2012, 11 (06) : 3090 - 3100
  • [33] Comparison of methods for T1-w brain MRI intensity normalization for quantitative MRI analysis
    Wallimann, P.
    Mayinger, M.
    Bogowicz, M.
    Guckenberger, M.
    Andratschke, N.
    Tanadini-Lang, S.
    van Timmeren, J. E.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S127 - S128
  • [34] A systematic comparison of deep learning methods for EEG time series analysis
    Walther, Dominik
    Viehweg, Johannes
    Haueisen, Jens
    Maeder, Patrick
    FRONTIERS IN NEUROINFORMATICS, 2023, 17
  • [35] A systematic comparison of novel and existing differential analysis methods for CyTOF data
    Arend, Lis
    Bernett, Judith
    Manz, Quirin
    Klug, Melissa
    Lazareva, Olga
    Baumbach, Jan
    Bongiovanni, Dario
    List, Markus
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [36] EXAMINATION OF NORMALIZATION METHODS FOR DATA ANALYSIS IN FREQUENCY DOMAIN
    Hales, M.
    Wang, Y. T.
    Johnson, B. F.
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2001, 33 (05): : S84 - S84
  • [37] Normalization Methods for the Analysis of Unbalanced Transcriptome Data: A Review
    Liu, Xueyan
    Li, Nan
    Liu, Sheng
    Wang, Jun
    Zhang, Ning
    Zheng, Xubin
    Leung, Kwong-Sak
    Cheng, Lixin
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7
  • [38] Analysis of automated methods for spatial normalization of lesioned brains
    Ripolles, P.
    Marco-Pallares, J.
    de Diego-Balaguer, R.
    Miro, J.
    Falip, M.
    Juncadella, M.
    Rubio, F.
    Rodriguez-Fornells, A.
    NEUROIMAGE, 2012, 60 (02) : 1296 - 1306
  • [39] Normalization Methods for Ethanol Raman Spectra Quantitative Analysis
    Wu Zheng-jie
    Huang Yao-xiong
    Wang Cheng
    Li Shao-fa
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2010, 30 (04) : 971 - 974
  • [40] Comparison of Image Normalization Methods for Multi-Site Deep Learning
    Albert, Steffen
    Wichtmann, Barbara D.
    Zhao, Wenzhao
    Maurer, Angelika
    Hesser, Juergen
    Attenberger, Ulrike I.
    Schad, Lothar R.
    Zoellner, Frank G.
    APPLIED SCIENCES-BASEL, 2023, 13 (15):