Lecture #8 | Tree Building

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/32

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:01 AM on 2/20/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

33 Terms

1
New cards

Discrete characters

No overlapping variation

  • teeth, no teeth

Divided into binary or multistate

  • Binary (0 or 1)

  • Multistate ( >2 states, 0 or 1 or 2)

2
New cards

Continuous characters

Measurements

3
New cards

Multistate characters

May be ordered (linear) or unordered

4
New cards

How are numerical scores assigned?

Usually (not always), the character state considered to be most ancestral is given the lowest numerical value, with more derived states given an increasingly higher value (1,2…) depending on the type of character

  • usually assigned with an initial hypothesis of character change developed by the investigator

  • Subject to reevaluation

5
New cards

Character polarity

Assignment of character order

  • evolutionary history of a trait or feature of an organism

6
New cards

Outgroup method

A method to determine polarity

  • If a character has 2 or more states, the state found in the next most related group (outgroup) is ancestral

  • By far the most common tree rooting method

  • Best to have two outgroups because it defends against autapomorphy (unique change in character state that is not informative for relationships)

7
New cards

What is important when considering methods of analysis for a tree?

  1. Informative-organized

  2. Predictive-Provides information that is fundamental to the relationships

  3. Stable- stable to new info from new taxa and data

  4. Operational- based on a set of procedures that can be accepted and refuted

8
New cards

Newick Format

Method to represent graphical trees, with or without branch lengths, using parentheses, commas, and a semicolon

9
New cards

How to construct a newick tree

  1. Identify internal nodes

  2. Add a set of parentheses for each internal node and a comma between the left and right descendants of each nodes. Add a semicolon to the end of the Newick tree

  3. Each branch length is preceded by a colon (;)

  4. Branch lengths (with their preceding :) are placed after the taxon names and after right parentheses (except the last one)

  • Binomial names require apostrophes or underscores to link together

10
New cards

Unrooted trees

Lack temporal polarization

  • point of common ancestry is missing

11
New cards

Rooted trees

Temporally polarized

  • point of common interest is given

12
New cards

Determining unrooted number of taxa

where n = number of taxa

13
New cards

Determining number of rooted trees

14
New cards

Gene duplications root

Paralogous gene duplications that predate the common ancestor of a taxonomic group are used to root the tree

  • root is placed between paralogous gene copies

15
New cards

Midpoint rooting

Tree is rooted on the midpoint between the two most distant leaves

  • choose the midpoint between the two most distant external nodes

  • assumes the rate of evolution is the same on the longest branches of the tree

16
New cards

Desirable properties of tree building methods

  • Consistency: will the method converge on the correct solution given enough data

  • Efficiency: How fast is the method

  • Power: How much data is needed for a reasonable result

  • Robustness: Will minor violations of the assumptions result in poor estimates of phylogeny

17
New cards

Types of data: Discrete versus Continuous

Discrete data is more common-few methods can handle continuous data

18
New cards

Types of data: Character versus Distance Data

Important that character comparisons between taxa can be used to develop distance matches, but reverse cannot happen

19
New cards

Types of data: Correct versus uncorrected data

Morphological characters may be standardized so that they all have equal value in an analysis

20
New cards

Cluster analysis

The recognition of groups of individuals on the basis of multiple characters. Groups may be mutually exclusive, hierarchic, or partially overlapping

21
New cards

Phonetics

classification based on numerous precisely delimited characters of equal weight and their comparison by an explicit method of grouping

22
New cards

Key points of cluster analysis

  1. Objectivity

  2. Polythetic Taxa: Groups based on character combinations

  3. Many characters- use as many characters as possible

  4. Equal weighting-every character has equal weight

  5. Overall similarity: groups recognized on basis of overall similarity nothing else

  6. Defining character polarity is nor important

23
New cards

How to create a cluster analysis

  1. Select taxa that represents both the entire geographical range and the entire morphological range of variation

  2. Select characters: As many characters as possible should be chosen. Each character gives equal weight to the determination of overall similarity

  3. Calculation of similarity/dissimilarity matrix

  4. Grouping OTUs by single linking (nearest neighbor)

24
New cards

Advantaged of cluster methods for determining relationships

  1. Operational: clearly defined procedures

  2. Communicable: anyone can code for characters and produce a classification without prior knowledge

  3. No weighting or preference for certain characters

25
New cards

Problems of cluster methods

Relationships depicted are strongly affected by

  1. choice of characters

  2. number of taxa

  3. type of similarity coefficient

  4. Clustering technique applied to similarity matrix

  5. higher categories are subjective

Groupings are more technique dependent providing an artificial grouping of taxa rather than moving towards a system of uncovering stable relationships

26
New cards

How often are phenetic (cluster) methods used?

Rarely do studies that utilize morphological characters ever use phenetic methods.

  • However, phenetics are still used for relatively simple organisms like prokaryotes

27
New cards

What assumptions allow for use of distance methods for molecular data?

  1. Molecular clock is assumed whereby mutations at any particular site in the genome are random and occur with equal frequency over time

  2. most changes are observed

  3. Character system is enormous with a potential to use the entire genome for analysis

  4. Changes in the genome are expected to be independent of environmental or selective pressure and less subject to convergence

But we know

  1. There is a preference for transitions

  2. not all positions of codons change at equal rates

  3. different regions of the genome,e are more conservative

28
New cards

Morphometrics

The quantitative description, analysis, and interpretation of shape and shape variation in biology

29
New cards

Single linkage clustering method

30
New cards

UPGMA- Unweighted pair group mean average

31
New cards

Neighbor-Joining Methods

  • Widely used for constructing phylogenetic trees with molecular data

  • Can be applied to the data for the corrected matrix conversion

  • Assumes additivity, not ultrametricity, so all branch length divergencies are not necessarily equal

    • Branch lengths in the matrix and the tree path length match perfectly and there is a single and unique additive tree that fits the distance matrix

32
New cards

Advantaged of NJ

  • Branch lengths are additive and reflect the true distances between taxa

  • Fast computational time

  • Can invoke outgrip rooting of the tree

  • Can empty various models of character state evolution to adjust branch lengths relationships

33
New cards

Disadvantages of NJ

  • not possible to infer or directly map character back onto topology

  • produces a single tree with no evaluation of competing hypotheses

  • can produce a quick and dirty tree that may be very different from OC method trees