CONFINED: distinguishing biological from technical sources of variation by leveraging multiple methylation datasets

被引:4
|
作者
Thompson, Mike [1 ]
Chen, Zeyuan Johnson [1 ]
Rahmani, Elior [1 ]
Halperin, Eran [1 ,2 ,3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Anesthesiol & Perioperat Med, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
基金
以色列科学基金会;
关键词
EPIGENOME-WIDE ASSOCIATION; CELL-TYPE HETEROGENEITY; DNA METHYLATION; PROFILES; FIBROSIS; PACKAGE; DESIGN; GENES; RISK; NEED;
D O I
10.1186/s13059-019-1743-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Methylation datasets are affected by innumerable sources of variability, both biological (cell-type composition, genetics) and technical (batch effects). Here, we propose a reference-free method based on sparse canonical correlation analysis to separate the biological from technical sources of variability. We show through simulations and real data that our method, CONFINED, is not only more accurate than the state-of-the-art reference-free methods for capturing known, replicable biological variability, but it is also considerably more robust to dataset-specific technical variability than previous approaches. CONFINED is available as an R package as detailed at https://github.com/cozygene/CONFINED.
引用
收藏
页数:15
相关论文
共 37 条
  • [31] Multiple regression analysis of the sources of variation in orientation of two sympatric sandhoppers, Talitrus saltator and Talorchestia brito, from an exposed Mediterranean beach
    Scapini, F
    Aloia, A
    Bouslama, MF
    Chelazzi, L
    Colombini, I
    ElGtari, M
    Fallaci, M
    Marchetti, GM
    BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY, 2002, 51 (05) : 403 - 414
  • [32] LEVERAGING LABORATORY RESULTS FROM MULTIPLE DATA SOURCES TO RE-ASSESS A POSSIBLE ASSOCIATION BETWEEN SERUM URIC ACID AND SCHIZOPHRENIA IN REAL WORLD PRACTICE SETTINGS
    Aguilar, D.
    Chick, J.
    James, M. P.
    Wade, R. L.
    Winniewicz, P. W.
    Lu, J.
    Waheed, R.
    VALUE IN HEALTH, 2022, 25 (07) : S588 - S588
  • [33] Rapid preparation of 1-vinylimidazole based non-affinity polymers for the highly-selective purification of antibodies from multiple biological sources
    Lei, Yutian
    Liu, Xiao
    Lu, Li
    Liu, Cuihua
    Xu, Rongrong
    Huang, Shengfeng
    Shen, Yuan
    Deng, Chunping
    Yu, Jinchen
    Zhang, Tingting
    Crommen, Jacques
    Wang, Qiqin
    Jiang, Zhengjin
    JOURNAL OF CHROMATOGRAPHY A, 2020, 1632 (1632)
  • [34] In silico characterization of multiple genes encoding the GP63 virulence protein from Leishmania braziliensis: identification of sources of variation and putative roles in immune evasion
    Castro Neto, Artur L.
    Brito, Adriana N. A. L. M.
    Rezende, Antonio M.
    Magalhaes, Franklin B.
    de Melo Neto, Osvaldo P.
    BMC GENOMICS, 2019, 20 (1)
  • [35] In silico characterization of multiple genes encoding the GP63 virulence protein from Leishmania braziliensis: identification of sources of variation and putative roles in immune evasion
    Artur L. Castro Neto
    Adriana N. A. L. M. Brito
    Antonio M. Rezende
    Franklin B. Magalhães
    Osvaldo P. de Melo Neto
    BMC Genomics, 20
  • [36] QUANTITATIVE PURIFICATION AND DETERMINATION OF COMPOUNDS FROM BIOLOGICAL SOURCES BY MEANS OF MULTIPLE ELIMINATION THIN-LAYER CHROMATOGRAPHY (METC) USING A COMBINATION OF SOLVENTS OR (AND) A COMBINATION OF ADSORBENTS
    VANSUMERE, CF
    KINT, J
    COTTENIE, J
    ARCHIVES INTERNATIONALES DE PHYSIOLOGIE ET DE BIOCHIMIE, 1968, 76 (02): : 396 - +