Integration of datasets for individual prediction of DNA methylation-based biomarkers

被引:0
|
作者
Merzbacher, Charlotte [1 ]
Ryan, Barry [1 ]
Goldsborough, Thibaut [1 ]
Hillary, Robert F. [2 ]
Campbell, Archie [2 ]
Murphy, Lee [3 ]
Mcintosh, Andrew M. [2 ,4 ]
Liewald, David [5 ]
Harris, Sarah E. [5 ]
Mcrae, Allan F. [6 ]
Cox, Simon R. [5 ]
Cannings, Timothy I. [7 ]
Vallejos, Catalina A. [8 ,9 ]
Mccartney, Daniel L. [2 ]
Marioni, Riccardo E. [2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Scotland
[2] Univ Edinburgh, Inst Genet & Canc, Ctr Genom & Expt Med, Edinburgh EH4 2XU, Scotland
[3] Univ Edinburgh, Edinburgh Clin Res Facil, Edinburgh EH4 2XU, Scotland
[4] Univ Edinburgh, Ctr Clin Brain Sci, Div Psychiat, Edinburgh, Scotland
[5] Univ Edinburgh, Dept Psychol, Lothian Birth Cohorts, Edinburgh EH8 9JZ, Scotland
[6] Univ Queensland, Inst Mol Biosci, Brisbane, Australia
[7] Univ Edinburgh, Maxwell Inst Math Sci, Sch Math, Edinburgh EH9 3FD, Scotland
[8] Univ Edinburgh, Inst Genet & Canc, MRC Human Genet Unit, Edinburgh EH4 2XU, Scotland
[9] Alan Turing Inst, London, England
基金
英国惠康基金;
关键词
DNA methylation; Prediction; Biomarker; QUANTILE NORMALIZATION; PACKAGE; DESIGN;
D O I
10.1186/s13059-023-03114-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundEpigenetic scores (EpiScores) can provide biomarkers of lifestyle and disease risk. Projecting new datasets onto a reference panel is challenging due to separation of technical and biological variation with array data. Normalisation can standardise data distributions but may also remove population-level biological variation.ResultsWe compare two birth cohorts (Lothian Birth Cohorts of 1921 and 1936 - nLBC1921 = 387 and nLBC1936 = 498) with blood-based DNA methylation assessed at the same chronological age (79 years) and processed in the same lab but in different years and experimental batches. We examine the effect of 16 normalisation methods on a novel BMI EpiScore (trained in an external cohort, n = 18,413), and Horvath's pan-tissue DNA methylation age, when the cohorts are normalised separately and together. The BMI EpiScore explains a maximum variance of R2=24.5% in BMI in LBC1936 (SWAN normalisation). Although there are cross-cohort R2 differences, the normalisation method makes a minimal difference to within-cohort estimates. Conversely, a range of absolute differences are seen for individual-level EpiScore estimates for BMI and age when cohorts are normalised separately versus together. While within-array methods result in identical EpiScores whether a cohort is normalised on its own or together with the second dataset, a range of differences is observed for between-array methods.ConclusionsNormalisation methods returning similar EpiScores, whether cohorts are analysed separately or together, will minimise technical variation when projecting new data onto a reference panel. These methods are important for cases where raw data is unavailable and joint normalisation of cohorts is computationally expensive.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Integration of datasets for individual prediction of DNA methylation-based biomarkers
    Charlotte Merzbacher
    Barry Ryan
    Thibaut Goldsborough
    Robert F. Hillary
    Archie Campbell
    Lee Murphy
    Andrew M. McIntosh
    David Liewald
    Sarah E. Harris
    Allan F. McRae
    Simon R. Cox
    Timothy I. Cannings
    Catalina A. Vallejos
    Daniel L. McCartney
    Riccardo E. Marioni
    Genome Biology, 24
  • [2] DNA Methylation-Based Biomarkers
    Cerchietti, Leandro
    Melnick, Ari
    JOURNAL OF CLINICAL ONCOLOGY, 2017, 35 (07) : 793 - 795
  • [3] DNA methylation-based biomarkers in urological cancers
    Xu, Rong
    Lin, Shaokun
    Xin, Jun
    Guo, Youhong
    Zhang, Wenzhou
    Lin, Yangjun
    Xu, Rui'an
    Information Technology Journal, 2013, 12 (17) : 4289 - 4297
  • [4] DNA methylation-based biomarkers in bladder cancer
    Raju Kandimalla
    Angela A. van Tilborg
    Ellen C. Zwarthoff
    Nature Reviews Urology, 2013, 10 : 327 - 335
  • [5] Developing DNA methylation-based diagnostic biomarkers
    Hyerim Kim
    Xudong Wang
    Peng Jin
    Journal of Genetics and Genomics, 2018, 45 (02) : 87 - 97
  • [6] DNA methylation-based biomarkers in bladder cancer
    Kandimalla, Raju
    van Tilborg, Angela A.
    Zwarthoff, Ellen C.
    NATURE REVIEWS UROLOGY, 2013, 10 (06) : 327 - 335
  • [7] Developing DNA methylation-based diagnostic biomarkers
    Kim, Hyerim
    Wang, Xudong
    Jin, Peng
    JOURNAL OF GENETICS AND GENOMICS, 2018, 45 (02) : 87 - 97
  • [8] DNA Methylation-based Diagnostic and Prognostic Biomarkers for Glioblastoma
    Tang, Yunliang
    Qing, Cheng
    Wang, Jiao
    Zeng, Zhenguo
    CELL TRANSPLANTATION, 2020, 29
  • [9] A clockwork fish: Age prediction using DNA methylation-based biomarkers in the European seabass
    Anastasiadi, Dafni
    Piferrer, Francesc
    MOLECULAR ECOLOGY RESOURCES, 2020, 20 (02) : 387 - 397
  • [10] DNA methylation-based biomarkers and the epigenetic clock theory of ageing
    Horvath, Steve
    Raj, Kenneth
    NATURE REVIEWS GENETICS, 2018, 19 (06) : 371 - 384