CONFINED: distinguishing biological from technical sources of variation by leveraging multiple methylation datasets

被引:4
|
作者
Thompson, Mike [1 ]
Chen, Zeyuan Johnson [1 ]
Rahmani, Elior [1 ]
Halperin, Eran [1 ,2 ,3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Anesthesiol & Perioperat Med, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
基金
以色列科学基金会;
关键词
EPIGENOME-WIDE ASSOCIATION; CELL-TYPE HETEROGENEITY; DNA METHYLATION; PROFILES; FIBROSIS; PACKAGE; DESIGN; GENES; RISK; NEED;
D O I
10.1186/s13059-019-1743-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Methylation datasets are affected by innumerable sources of variability, both biological (cell-type composition, genetics) and technical (batch effects). Here, we propose a reference-free method based on sparse canonical correlation analysis to separate the biological from technical sources of variability. We show through simulations and real data that our method, CONFINED, is not only more accurate than the state-of-the-art reference-free methods for capturing known, replicable biological variability, but it is also considerably more robust to dataset-specific technical variability than previous approaches. CONFINED is available as an R package as detailed at https://github.com/cozygene/CONFINED.
引用
收藏
页数:15
相关论文
共 37 条
  • [21] Modelling technical and biological biases in macroinvertebrate community assessment from bulk preservative using multiple metabarcoding markers
    Martins, Filipa M. S.
    Porto, Miguel
    Feio, Maria J.
    Egeter, Bastian
    Bonin, Aurelie
    Serra, Sonia R. Q.
    Taberlet, Pierre
    Beja, Pedro
    MOLECULAR ECOLOGY, 2021, 30 (13) : 3221 - 3238
  • [22] Predicting pharmacotherapeutic outcomes for type 2 diabetes: An evaluation of three approaches to leveraging electronic health record data from multiple sources
    Tarumi, Shinji
    Takeuchi, Wataru
    Qi, Rong
    Ning, Xia
    Ruppert, Laura
    Ban, Hideyuki
    Robertson, Daniel H.
    Schleyer, Titus
    Kawamoto, Kensaku
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 129
  • [23] Technical Note: A new global database of trace gases and aerosols from multiple sources of high vertical resolution measurements
    Hassler, B.
    Bodeker, G. E.
    Dameris, M.
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2008, 8 (17) : 5403 - 5421
  • [24] Sources of variation in baseline gene expression levels from toxicogenomics study control animals across multiple laboratories
    Michael J Boedigheimer
    Russell D Wolfinger
    Michael B Bass
    Pierre R Bushel
    Jeff W Chou
    Matthew Cooper
    J Christopher Corton
    Jennifer Fostel
    Susan Hester
    Janice S Lee
    Fenglong Liu
    Jie Liu
    Hui-Rong Qian
    John Quackenbush
    Syril Pettit
    Karol L Thompson
    BMC Genomics, 9
  • [25] Sources of variation in baseline gene expression levels from toxicogenomics study control animals across multiple laboratories
    Boedigheimer, Michael J.
    Wolfinger, Russell D.
    Bass, Michael B.
    Bushel, Pierre R.
    Chou, Jeff W.
    Cooper, Matthew
    Corton, J. Christopher
    Fostel, Jennifer
    Hester, Susan
    Lee, Janice S.
    Liu, Fenglong
    Liu, Jie
    Qian, Hui-Rong
    Quackenbush, John
    Pettit, Syril
    Thompson, Karol L.
    BMC GENOMICS, 2008, 9 (1)
  • [26] BIOLOGICAL VARIABILITY IN CONCENTRATIONS OF SERUM-LIPIDS - SOURCES OF VARIATION AMONG RESULTS FROM PUBLISHED STUDIES AND COMPOSITE PREDICTED VALUES
    SMITH, SJ
    COOPER, GR
    MYERS, GL
    SAMPSON, EJ
    CLINICAL CHEMISTRY, 1993, 39 (06) : 1012 - 1022
  • [27] STUDIES ON BIOLOGICAL METHYLATION .14. THE FORMATION OF TRIMETHYLARSINE AND DIMETHYL SELENIDE IN MOULD CULTURES FROM METHYL SOURCES CONTAINING C-14
    CHALLENGER, F
    LISLE, DB
    DRANSFIELD, PB
    JOURNAL OF THE CHEMICAL SOCIETY, 1954, (JUN): : 1760 - 1771
  • [28] DNA methylation analysis of multiple tissues from newborn twins reveals both genetic and intrauterine components to variation in the human neonatal epigenome
    Ollikainen, Miina
    Smith, Katherine R.
    Joo, Eric Ji-Hoon
    Ng, Hong Kiat
    Andronikos, Roberta
    Novakovic, Boris
    Aziz, Nur Khairunnisa Abdul
    Carlin, John B.
    Morley, Ruth
    Saffery, Richard
    Craig, Jeffrey M.
    HUMAN MOLECULAR GENETICS, 2010, 19 (21) : 4176 - 4188
  • [29] Kappa/Lambda Ratio for Early Detection of Multiple Myeloma Relapse Using the Reference Change Value from Biological Variation Studies
    Wu, Alan H. B.
    JOURNAL OF APPLIED LABORATORY MEDICINE, 2021, 6 (06): : 1683 - 1687
  • [30] Multiple regression analysis of the sources of variation in orientation of two sympatric sandhoppers, Talitrus saltator and Talorchestia brito, from an exposed Mediterranean beach
    Felicita Scapini
    Andrea Aloia
    Mohamed F. Bouslama
    Lorenzo Chelazzi
    Isabella Colombini
    Mohamed ElGtari
    Mario Fallaci
    Giovanni M.Marchetti
    Behavioral Ecology and Sociobiology, 2002, 51 : 403 - 414