The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference

被引:104
|
作者
Flagel, Lex [1 ,2 ]
Brandvain, Yaniv [2 ]
Schrider, Daniel R. [3 ]
机构
[1] Monsanto Co, Chesterfield, MO USA
[2] Univ Minnesota, Dept Plant & Microbial Biol, St Paul, MN 55108 USA
[3] Univ N Carolina, Dept Genet, Chapel Hill, NC 27515 USA
基金
美国国家卫生研究院;
关键词
population genetics; selective sweeps; demographic inference; recombination; machine learning; introgression; SELECTIVE SWEEPS; POSITIVE SELECTION; LINKAGE DISEQUILIBRIUM; RECOMBINATION RATES; STATISTICAL TESTS; GENOMIC REGIONS; SOFT SWEEPS; HISTORY; MODEL; HITCHHIKING;
D O I
10.1093/molbev/msy224
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Population-scale genomic data sets have given researchers incredible amounts of information from which to infer evolutionary histories. Concomitant with this flood of data, theoretical and methodological advances have sought to extract information from genomic sequences to infer demographic events such as population size changes and gene flow among closely related populations/species, construct recombination maps, and uncover loci underlying recent adaptation. To date, most methods make use of only one or a few summaries of the input sequences and therefore ignore potentially useful information encoded in the data. The most sophisticated of these approaches involve likelihood calculations, which require theoretical advances for each new problem, and often focus on a single aspect of the data (e.g., only allele frequency information) in the interest of mathematical and computational tractability. Directly interrogating the entirety of the input sequence data in a likelihood-free manner would thus offer a fruitful alternative. Here, we accomplish this by representing DNA sequence alignments as images and using a class of deep learning methods called convolutional neural networks (CNNs) to make population genetic inferences from these images. We apply CNNs to a number of evolutionary questions and find that they frequently match or exceed the accuracy of current methods. Importantly, we show that CNNs perform accurate evolutionary model selection and parameter estimation, even on problems that have not received detailed theoretical treatments. Thus, when applied to population genetic alignments, CNNs are capable of outperforming expert-derived statistical methods and offer a new path forward in cases where no likelihood approach exists.
引用
收藏
页码:220 / 238
页数:19
相关论文
共 50 条
  • [1] Dispersal inference from population genetic variation using a convolutional neural network
    Smith, Chris C. R.
    Tittes, Silas
    Ralph, Peter L.
    Kern, Andrew D.
    [J]. GENETICS, 2023, 224 (02)
  • [2] Simulating quantized inference on convolutional neural networks
    Finotti, Vitor
    Albertini, Bruno
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95
  • [3] Simulating quantized inference on convolutional neural networks
    Finotti, Vitor
    Albertini, Bruno
    [J]. Computers and Electrical Engineering, 2021, 95
  • [4] xDNN: Inference for Deep Convolutional Neural Networks
    D'Alberto, Paolo
    Wu, Victor
    Ng, Aaron
    Nimaiyar, Rahul
    Delaye, Elliott
    Sirasao, Ashish
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (02)
  • [5] DSIP: A Scalable Inference Accelerator for Convolutional Neural Networks
    Jo, Jihyuck
    Cha, Soyoung
    Rho, Dayoung
    Park, In-Cheol
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2018, 53 (02) : 605 - 618
  • [6] Binarized Convolutional Neural Networks for Efficient Inference on GPUs
    Khan, Mir
    Huttunen, Heikki
    Boutellier, Jani
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 682 - 686
  • [7] Convolutional Neural Networks for Valid and Efficient Causal Inference
    Ghasempour, Mohammad
    Moosavi, Niloofar
    de Luna, Xavier
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 714 - 723
  • [8] The unreasonable effectiveness of neural network approximation
    Dingankar, AT
    [J]. SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1345 - 1349
  • [9] The unreasonable effectiveness of neural network approximation
    Dingankar, AT
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (11) : 2043 - 2044
  • [10] Convolutional Neural Networks Inference Accelerator Design using Selective Convolutional Layer
    Huang, Tzu-Huan
    Goh, Emil
    Wey, I-Chyn
    Teo, T. Hui
    [J]. 2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 166 - 170