Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees

被引:1
|
作者
Brand, Colin M. [1 ,2 ]
Kuang, Shuzhen [3 ]
Gilbertson, Erin N. [1 ,4 ]
McArthur, Evonne [5 ,6 ]
Pollard, Katherine S. [1 ,2 ,3 ,7 ]
Webster, Timothy H. [8 ]
Capra, John A. [1 ,2 ,4 ]
机构
[1] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco 94143, CA USA
[3] Gladstone Inst Data Sci & Biotechnol, San Francisco, CA USA
[4] Univ Calif San Francisco, Biomed Informat Grad Program, San Francisco 94143, CA USA
[5] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN USA
[6] Univ Washington, Dept Med, Seattle, WA USA
[7] Chan Zuckerberg Biohub, San Francisco, CA USA
[8] Univ Utah, Dept Anthropol, Salt Lake City, UT USA
来源
GENOME BIOLOGY AND EVOLUTION | 2024年 / 16卷 / 11期
基金
美国国家卫生研究院;
关键词
bonobo; chimpanzee; gene regulation; 3D genome folding; machine learning; CHROMATIN DOMAINS; ORGANIZATION; DIVERSITY; EVOLUTION; DATABASE; TOOL;
D O I
10.1093/gbe/evae210
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The 3D structure of the genome is an important mediator of gene expression. As phenotypic divergence is largely driven by gene regulatory variation, comparing genome 3D contacts across species can further understanding of the molecular basis of species differences. However, while experimental data on genome 3D contacts in humans are increasingly abundant, only a handful of 3D genome contact maps exist for other species. Here, we demonstrate that human experimental data can be used to close this data gap. We apply a machine learning model that predicts 3D genome contacts from DNA sequence to the genomes from 56 bonobos and chimpanzees and identify species-specific patterns of genome folding. We estimated 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows, of which similar to 17% were substantially divergent in predicted genome contacts. Bonobos and chimpanzees diverged at 89 windows, overlapping genes associated with multiple traits implicated in Pan phenotypic divergence. We discovered 51 bonobo-specific variants that individually produce the observed bonobo contact pattern in bonobo-chimpanzee divergent windows. Our results demonstrate that machine learning methods can leverage human data to fill in data gaps across species, offering the first look at population-level 3D genome variation in nonhuman primates. We also identify loci where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Sequence-based genetic mapping of Cynodon dactylon Pers. reveals new insights into genome evolution in Poaceae
    Fang, Tilin
    Dong, Hongxu
    Yu, Shuhao
    Moss, Justin Q.
    Fontanier, Charles H.
    Martin, Dennis L.
    Fu, Jinmin
    Wu, Yanqi
    COMMUNICATIONS BIOLOGY, 2020, 3 (01)
  • [32] VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques
    Yadav, Arvind Kumar
    Singla, Deepak
    JOURNAL OF BIOSCIENCES, 2020, 45 (01)
  • [33] A machine-learning approach for predicting palmitoylation sites from integrated sequence-based features
    Li, Liqi
    Luo, Qifa
    Xiao, Weidong
    Li, Jinhui
    Zhou, Shiwen
    Li, Yongsheng
    Zheng, Xiaoqi
    Yang, Hua
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2017, 15 (01)
  • [34] VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques
    Arvind Kumar Yadav
    Deepak Singla
    Journal of Biosciences, 2020, 45
  • [35] Attention-Based Pose Sequence Machine for 3D Hand Pose Estimation
    Guo, Fangtai
    He, Zaixing
    Zhang, Shuyou
    Zhao, Xinyue
    Tan, Jianrong
    IEEE ACCESS, 2020, 8 : 18258 - 18269
  • [36] Sequence-based genetic mapping of Cynodon dactylon Pers. reveals new insights into genome evolution in Poaceae
    Tilin Fang
    Hongxu Dong
    Shuhao Yu
    Justin Q. Moss
    Charles H. Fontanier
    Dennis L. Martin
    Jinmin Fu
    Yanqi Wu
    Communications Biology, 3
  • [37] Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions
    Hong, Xiaokun
    Lv, Jiyang
    Li, Zhengxin
    Xiong, Yi
    Zhang, Jian
    Chen, Hai-Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 243
  • [38] Noise Filtering Method of a 3D Target Image Based on Machine Learning
    Ren, Yanfei
    Engineering Intelligent Systems, 2021, 29 (04): : 195 - 204
  • [39] Towards Virtual 3D Asset Price Prediction Based on Machine Learning
    Korbel, Jakob J.
    Siddiq, Umar H.
    Zarnekow, Ruediger
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2022, 17 (03): : 924 - 948
  • [40] 3D Reconstruction Method of Virtual and Real Fusion Based on Machine Learning
    Zhu, Wenyao
    Zhou, Shuyue
    Mathematical Problems in Engineering, 2022, 2022