Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees

被引:1
|
作者
Brand, Colin M. [1 ,2 ]
Kuang, Shuzhen [3 ]
Gilbertson, Erin N. [1 ,4 ]
McArthur, Evonne [5 ,6 ]
Pollard, Katherine S. [1 ,2 ,3 ,7 ]
Webster, Timothy H. [8 ]
Capra, John A. [1 ,2 ,4 ]
机构
[1] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco 94143, CA USA
[3] Gladstone Inst Data Sci & Biotechnol, San Francisco, CA USA
[4] Univ Calif San Francisco, Biomed Informat Grad Program, San Francisco 94143, CA USA
[5] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN USA
[6] Univ Washington, Dept Med, Seattle, WA USA
[7] Chan Zuckerberg Biohub, San Francisco, CA USA
[8] Univ Utah, Dept Anthropol, Salt Lake City, UT USA
来源
GENOME BIOLOGY AND EVOLUTION | 2024年 / 16卷 / 11期
基金
美国国家卫生研究院;
关键词
bonobo; chimpanzee; gene regulation; 3D genome folding; machine learning; CHROMATIN DOMAINS; ORGANIZATION; DIVERSITY; EVOLUTION; DATABASE; TOOL;
D O I
10.1093/gbe/evae210
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The 3D structure of the genome is an important mediator of gene expression. As phenotypic divergence is largely driven by gene regulatory variation, comparing genome 3D contacts across species can further understanding of the molecular basis of species differences. However, while experimental data on genome 3D contacts in humans are increasingly abundant, only a handful of 3D genome contact maps exist for other species. Here, we demonstrate that human experimental data can be used to close this data gap. We apply a machine learning model that predicts 3D genome contacts from DNA sequence to the genomes from 56 bonobos and chimpanzees and identify species-specific patterns of genome folding. We estimated 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows, of which similar to 17% were substantially divergent in predicted genome contacts. Bonobos and chimpanzees diverged at 89 windows, overlapping genes associated with multiple traits implicated in Pan phenotypic divergence. We discovered 51 bonobo-specific variants that individually produce the observed bonobo contact pattern in bonobo-chimpanzee divergent windows. Our results demonstrate that machine learning methods can leverage human data to fill in data gaps across species, offering the first look at population-level 3D genome variation in nonhuman primates. We also identify loci where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Machine Learning Methods for Exploring Sequence Determinants of 3D Genome Organization
    Yang, Muyu
    Ma, Jian
    JOURNAL OF MOLECULAR BIOLOGY, 2022, 434 (15)
  • [2] Sequence-based identification of 3D structural modules in RNA with RMDetect
    Cruz J.A.
    Westhof E.
    Nature Methods, 2011, 8 (6) : 513 - 519
  • [3] Sequence-based identification of 3D structural modules in RNA with RMDetect
    Cruz, Jose Almeida
    Westhof, Eric
    NATURE METHODS, 2011, 8 (06) : 513 - U115
  • [4] Generalized Pose Decoupled Network for Unsupervised 3D Skeleton Sequence-Based Action Representation Learning
    Liu, Mengyuan
    Meng, Fanyang
    Liang, Yongsheng
    CYBORG AND BIONIC SYSTEMS, 2022, 2022
  • [5] Individual and population-level variation in 3D genome folding among bonobos (Pan paniscus) and chimpanzees (Pan troglodytes)
    Brand, Colin M.
    Kuang, Shuzhen
    Gilbertson, Erin N.
    Mcarthur, Evonne
    Pollard, Katherine S.
    Webster, Timothy H.
    Capra, John A.
    AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY, 2023, 180 : 20 - 21
  • [6] Sequence-Based Prediction of Cysteine Reactivity Using Machine Learning
    Wang, Haobo
    Chen, Xuemin
    Li, Can
    Liu, Yuan
    Yang, Fan
    Wang, Chu
    BIOCHEMISTRY, 2018, 57 (04) : 451 - 460
  • [7] Sequence-based analysis and prediction of lantibiotics: A machine learning approach
    Poorinmohammad, Naghmeh
    Hamedi, Javad
    Moghaddam, Mohammad Hossein Abbaspour Motlagh
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 77 : 199 - 206
  • [8] Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning
    Ye, Yuan-Nong
    Liang, Ding-Fa
    Labena, Abraham Alemayehu
    Zeng, Zhu
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 2731 - 2741
  • [9] Improving sequence-based fold recognition by using 3D model quality assessment
    Pettitt, CS
    McGuffin, LJ
    Jones, DT
    BIOINFORMATICS, 2005, 21 (17) : 3509 - 3515
  • [10] Machine Learning Reveals the Diversity of Human 3D Chromatin Contact Patterns
    Gilbertson, Erin N.
    Brand, Colin M.
    Mcarthur, Evonne
    Rinker, David C.
    Kuang, Shuzhen
    Pollard, Katherine S.
    Capra, John A.
    MOLECULAR BIOLOGY AND EVOLUTION, 2024, 41 (10)