Hypothesis Testing
Phylogenetics
Hypothesis Testing
Presenter: Joe Ironside
A critical aspect of phylogenetics involving evaluation of proposed evolutionary trees, assessing their validity and reliability.
Robustness of Phylogenies
Essential Question: Is our tree robust?
Inquiry into the reliability of the constructed phylogenetic tree.
Exploration of confidence levels regarding the tree's reflection of the true phylogeny.
Consensus Trees
A tool to summarize a whole set of phylogenetic trees. Differentiated into types:
Strict Consensus Tree:
Clades must be present in all trees.
Semi-Strict Consensus Tree:
Clades must be present in at least one tree and not contradicted by any tree.
Majority Rule Consensus Tree:
Clades must be present in a specified proportion of trees.
Typically requires presence in at least 50% of trees.
Clades not meeting these criteria are collapsed, generating polytomies.
Examples include:
Mammals
Turtles
Lizards
Crocodilians
Birds
Bootstrapping
A technique to assess the reliability of phylogenetic trees:
Description: Pseudoreplication is employed where a new dataset is created by randomly sampling characters from the original dataset.
Procedure:
Phylogeny is reconstructed for each pseudoreplicate.
This process is repeated many times.
Majority Rule Consensus Tree is then constructed using optimal trees derived from all pseudoreplicates.
Bootstrap Support for a Branch:
Defined as the percentage of optimal trees in which the branch appears.
Results from bootstrapping may show supportive values:
Example: Leads to bootstrap values (e.g., Mammals: 83, Turtles: 34, Lizards: 98, Crocodilians: 47).
Monophyly
Monophyletic Group:
A group that includes all descendants of a common ancestor, thus depicting a complete lineage.
Hierarchical classification includes:
Pisces
Amphibia
Mammalia
Testudines
Lepidosauria
Crocodylia
Archosauria
Diapsida
Reptilia
Amniota
Tetrapoda
Vertebrata
Aves
Paraphyly
Paraphyletic Group:
A group that excludes one or more descendants of a common ancestor, which are placed in a separate category. Examples include:
Reptilia:
This group excludes birds.
Artiodactyla (Even-toed Ungulates):
This group excludes cetaceans (whales, dolphins).
Invertebrates:
This group excludes vertebrates.
Prokaryotes:
This group excludes eukaryotes.
Polyphyly
Polyphyletic Group:
A group that excludes the common ancestor, and some of its descendants may also be excluded.
Examples:
Warm-blooded vertebrates exclude the common ancestor of birds and mammals, as well as all reptilian descendants of this common ancestor.
Comparison of Group Types
Monophyly: All descendants from a common ancestor included.
Paraphyly: Excludes one or more descendants from a common ancestor, thus not fully inclusive.
Polyphyly: Excludes the common ancestor itself, leading to a more unrelated grouping.
Additional Hypothesis Testing
Fundamental Question: Is one tree significantly better than another?
Comparing Two Trees
Involves specifying two or more trees a priori without utilizing data for a tree search.
Utilizes the Kishino-Hasegawa (KH) test for this comparison and evaluation.
Using Constrained Trees
Allows for hypothesis testing even without specifying trees a priori:
Procedure:
A constrained tree is generated by specifying constraints on the analysis.
A tree search is performed utilizing parsimony, likelihood, or Bayesian criteria.
Only trees meeting the specified constraints are retained.
Example:
Constrained Tree: Birds and crocodilians form a monophyletic group.
Unconstrained Tree: Birds and crocodilians may not form a monophyletic group.
Hypothesis Testing Methods
Parsimony Testing
With Constraint Enforced: Produces the most parsimonious tree(s) that meet the constraints.
Without Constraint: Produces the most parsimonious tree(s) irrespective of constraints.
Tests: Check for significant differences in parsimony using:
Templeton Test
Winning Sites Test
Likelihood Testing
With Constraint Enforced: Generates likeliest tree(s) that fulfill constraints.
Without Constraint: Generates likeliest tree(s) independent of constraints.
Tests: Assess significant differences in likelihood using:
Topology-based Tests
Shimodaira-Hasegawa (SH) Test
Approximately Unbiased (AU) Test
Bayesian Testing
Involves Markov Chain Monte Carlo (MCMC) Simulation:
Yields a set of best trees at a state of stationarity.
Within this set, identifies the number of trees that are consistent with the constraint tree.
Computes posterior probability of the hypothesis by dividing consistent tree counts by total tree counts.
Phylogenetic Comparative Methods
Multispecies Comparative Studies:
Design to test hypotheses by comparing traits across different species.
Example: Investigating if polygynous bird species tend to exhibit bright male coloration.
Addressing Phylogenetic Confounding
Acknowledge the confounding effects of phylogeny, as species may not be independent data points due to similarities resulting from recent common ancestry.
Experimental Design:
Should mitigate this confounding effect, specifically avoiding phylogenetic pseudoreplication (e.g., comparing polygynous pheasants with monogamous geese).
Phylogenetically Independent Contrasts (PICs)
A methodological approach designed to account for phylogenetic relationships, thus correcting for the influence of phylogeny in statistical analyses.