SPECTRAL-ANALYSIS OF PHYLOGENETIC DATA

被引:140
|
作者
HENDY, MD [1 ]
机构
[1] MASSEY UNIV,DEPT BOT & ZOOL,PALMERSTON NORTH,NEW ZEALAND
关键词
PHYLOGENETIC TREES; BIPARTITION; HADAMARD TRANSFORM; HADAMARD CONJUGATION; SPECTRUM; NUCLEOTIDE SEQUENCES; DISTANCE DATA; FAST HADAMARD TRANSFORM;
D O I
10.1007/BF02638451
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The spectral analysis of sequence and distance data is a new approach to phylogenetic analysis. For two-state character sequences, the character values at a given site split the set of taxa into two subsets, a bipartition of the taxa set. The vector which counts the relative numbers of each of these bipartitions over all sites is called a sequence spectrum. Applying a transformation called a Hadamard conjugation, the sequence spectrum is transformed to the conjugate spectrum. This conjugation corrects for unobserved changes in the data, independently from the choice of phylogenetic tree. For any given phylogenetic tree with edge weights (probabilities of state change), we define a corresponding tree spectrum. The selection of a weighted phylogenetic tree from the given sequence data is made by matching the conjugate spectrum with a tree spectrum. We develop an optimality selection procedure using a least squares best fit, to find the phylogenetic tree whose tree spectrum most closely matches the conjugate spectrum. An inferred sequence spectrum can be derived from the selected tree spectrum using the inverse Hadamard conjugation to allow a comparison with the original sequence spectrum.
引用
收藏
页码:5 / 24
页数:20
相关论文
共 50 条