Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
Gonnet, Gastón H.
- Journal Article
Rights / licenseCreative Commons Attribution 2.0 Generic
Background The estimation of the difference between two evolutionary distances within a triplet of homologs is a common operation that is used for example to determine which of two sequences is closer to a third one. The most accurate method is currently maximum likelihood over the entire triplet. However, this approach is relatively time consuming. Results We show that an alternative estimator, based on pairwise estimates and therefore much faster to compute, has almost the same statistical power as the maximum likelihood estimator. We also provide a numerical approximation for its variance, which could otherwise only be estimated through an expensive re-sampling approach such as bootstrapping. An extensive simulation demonstrates that the approximation delivers precise confidence intervals. To illustrate the possible applications of these results, we show how they improve the detection of asymmetric evolution, and the identification of the closest relative to a given sequence in a group of homologs. Conclusion The results presented in this paper constitute a basis for large-scale protein cross-comparisons of pairwise evolutionary distances. Show more
Journal / seriesBMC Bioinformatics
Pages / Article No.
SubjectGENANALYSE (GENETISCHE TECHNIKEN); GENOMIK (MOLEKULARE GENETIK); PHYLOGENESE (BIOLOGISCHE EVOLUTION); GENE ANALYSIS (GENETIC TECHNIQUES); GENOMICS (MOLECULAR GENETICS); PHYLOGENY (BIOLOGICAL EVOLUTION); Evolutionary Distance; Pairwise Distance; Close Homolog; Substitution Matrice; Accurate Confidence Interval
Organisational unit02644 - Institut für Wissenschaftliches Rechnen, direkt
03309 - Gonnet, Gaston
MoreShow all metadata