Relatório de pesquisa 34/04

Cézar A. F. Anselmo and Aluísio Pinheiro, Phylogenetic Trees via Hamming Distance Decomposition Tests

Abstract:
The paper considers the problem of Phylogenetic tree construction. Our approach to the problem bases itself on a non-parametric paradigma seeking a model free construction and symmetry on Type I and II errors. Trees are constructed through sequential tests using Hamming distance dissimilarity measures, from internal nodes to the tips. The method has some advantages over the traditional methods. It is very fast, computationally efficient, and feasible to be used for very large datasets.
 Two other novelties are its capacity to deal directly with multiple sequences per group (and built  its statistical properties upon this richer information) and that the best tree will not have   a predetermined number of tips i.e. the resulting number of tips will be statistically meaningful.We applied the method in a sample of primate mitochondrial DNA sequences,   illustrating that it can perform quite well even on very unbalanced design. Computational complexities are also addressed.

Keywords: Phylogenetic Tree; Hamming Distance; Dissimilarity Measures; Statistical Genetics; Non-parametric Test.




 

Copy of the file:

rp34-04.ps (Postscript)

rp34-04.ps.gz (gzipped postscriptF)

July 21, 2004

 

Volta ao indíce de Relatórios de Pesquisa