ModelRevelator: Fast phylogenetic model estimation via deep learning

被引：9

作者：

Burgstaller-Muehlbacher, Sebastian ^{[1
,2
]}

Crotty, Stephen M. ^{[3
,4
]}

Schmidt, Heiko A. ^{[1
,2
]}

Reden, Franziska ^{[1
,2
]}

Drucks, Tamara ^{[1
,2
,6
]}

von Haeseler, Arndt ^{[1
,2
,5
]}

机构：

[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria

[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria

[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia

[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia

[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria

[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria

来源：

MOLECULAR PHYLOGENETICS AND EVOLUTION | 2023年 / 188卷

关键词：

Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;

D O I：

10.1016/j.ympev.2023.107905

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.

引用

页数：16

共 50 条

[41] Estimation of personal driving style via deep inverse reinforcement learning
Daiko Kishikawa
Sachiyo Arai
Artificial Life and Robotics, 2021, 26 : 338 - 346
[42] Motion Estimation via Scale-Space in Unsupervised Deep Learning
Kim, Jaehwan
Derbel, Bilel
Hong, Byung-Woo
35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 730 - 731
[43] Visibility estimation via deep label distribution learning in cloud environment
Song, Mofei
Han, Xu
Liu, Xiao Fan
Li, Qian
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2021, 10 (01):
[44] Off-grid DOA estimation via a deep learning framework
Yan HUANG
Yanjun ZHANG
Jun TAO
Cai WEN
Guisheng LIAO
Wei HONG
Science China(Information Sciences), 2023, 66 (12) : 222 - 237
[45] Deep Learning for Opportunistic Rain Estimation via Satellite Microwave Links
Scognamiglio, Giovanni
Rucci, Andrea
Vaccaro, Attilio
Adirosi, Elisa
Sapienza, Fabiola
Giannetti, Filippo
Bacci, Giacomo
Angeloni, Sabina
Baldini, Luca
Roversi, Giacomo
Ortolani, Alberto
Antonini, Andrea
Melani, Samantha
SENSORS, 2024, 24 (21)
[46] Estimation of personal driving style via deep inverse reinforcement learning
Kishikawa, Daiko
Arai, Sachiyo
ARTIFICIAL LIFE AND ROBOTICS, 2021, 26 (03) : 338 - 346
[47] Visibility estimation via deep label distribution learning in cloud environment
Mofei Song
Xu Han
Xiao Fan Liu
Qian Li
Journal of Cloud Computing, 10
[48] Two-Dimensional DOA Estimation via Deep Ensemble Learning
Zhu, Wenli
Zhang, Min
Li, Pengfei
Wu, Chenxi
IEEE ACCESS, 2020, 8 : 124544 - 124552
[49] Depression Intensity Estimation via Social Media: A Deep Learning Approach
Ghosh, Shreya
Anwar, Tarique
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (06) : 1465 - 1474
[50] Deep Learning Based Wiretap Coding via Mutual Information Estimation
Fritschek, Rick
Schaefer, Rafael F.
Wunder, Gerhard
PROCEEDINGS OF THE 2ND ACM WORKSHOP ON WIRELESS SECURITY AND MACHINE LEARNING, WISEML 2020, 2020, : 74 - 79

← 1 2 3 4 5 →