Phylogenetic Trees via Hamming Distance Decomposition Tests

Número: 
34
Ano: 
2004
Autor: 
Cézar A. F. Anselmo
Aluísio Pinheiro
Abstract: 

The paper considers the problem of Phylogenetic tree construction. Our approach to the problem bases itself on a non-parametric paradigma seeking a model free construction and symmetry on Type I and II errors. Trees are constructed through sequential tests using Hamming distance dissimilarity measures, from internal nodes to the tips. The method has some advantages over the traditional methods. It is very fast, computationally efficient, and feasible to be used for very large datasets. Two other novelties are its capacity to deal directly with multiple sequences per group (and built its statistical properties upon this richer information) and that the best tree will not have a predetermined number of tips i.e. the resulting number of tips will be statistically meaningful.We applied the method in a sample of primate mitochondrial DNA sequences, illustrating that it can perform quite well even on very unbalanced design. Computational complexities are also addressed.

Keywords: 
Phylogenetic Tree
Hamming Distance
Dissimilarity Measures
Statistical Genetics
Non-parametric Test
Arquivo: