Variability

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Knowt Play

New

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/9

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

10 Terms

New cards

Spread and Variability

Variability - used to determine how spread out a group of scores are, or if they are bunched together
- Synonymous (nearly the same) with terms spread and dispersion
While central tendency describes the center of the distribution, variability describes the divergence (gap/separation) from the center

The scores are the same = no variability

A small difference between scores = small variability

A large difference between scores = large variability

<ul><li><p><strong>Variability</strong> - <u>used to determine</u> how <mark data-color="purple" style="background-color: purple; color: inherit">spread out a group of scores are</mark>, <em>or if they are</em> <mark data-color="purple" style="background-color: purple; color: inherit">bunched together</mark></p><ul><li><p>Synonymous (nearly the same) with terms spread and dispersion</p></li></ul></li><li><p>While central tendency describes the center of the distribution, <mark data-color="purple" style="background-color: purple; color: inherit">variability describes the divergence</mark> (<em>gap/separation</em>) <mark data-color="purple" style="background-color: purple; color: inherit">from the center</mark></p></li></ul><p>The <span style="color: green">scores</span> are the <span style="color: green">same</span> = <span style="color: green">no variability</span></p><p>A <span style="color: blue">small difference</span> between <strong>scores</strong> = <span style="color: blue">small variability</span></p><p>A <span style="color: red">large difference</span> between <strong>scores</strong> = <span style="color: red">large variability</span></p>

New cards

Measuring Variability

Can utilise a few difference measures to describe variability:
- Describes if scores are clustered or spread over distance
- informs us of how well a score represents the entire distribution
We do this through:
- Range
- Interquartile Range
- Standard Deviation

New cards

Range

distance between the largest score (Xmax) and the smallest score (Xmin)

FORMULA)

Range = URL Xmax - LRL Xmax

URL = Upper Real Limit

LRL = Lower Real Limit

<ul><li><p><strong>distance between</strong> the <span style="color: blue">largest score</span> (<em>Xmax</em>) and the <span style="color: red">smallest score</span> (<em>Xmin</em>)</p></li></ul><p></p><p>FORMULA)</p><p>Range = URL <span style="color: blue">Xmax</span> - LRL <span style="color: red">Xmax</span></p><p></p><p>URL = Upper Real Limit</p><p>LRL = Lower Real Limit</p>

New cards

Interquartile Range (IQR)

The range of the middle 50% of the distribution
FORMULA) IQR = Q3 - Q1

EXAMPLE)

The data set: 1, 2, 2, 3, 3, 4, 4, 4(8th), 5(9th), 5, 5, 5, 6, 6, 6, 7

Step 1: Find the Median (Q2)

There are 16 numbers

Middle position is between the 8th and 9th values
8th value = 4, 9th value = 5
Median = (4 + 5) / 2 = 4.5

So the dataset is split into lower half (first 8 numbers) and upper half (last 8 numbers).

Step 2: Find Q1 (the median of the lower half)

Lower half: 1, 2, 2, 3(4th), 3(5th), 4, 4, 4

There are 8 numbers

Middle is between 4th and 5th values = (3 + 3) ÷ 2 = 3

So Q1 = 3

Step 3: Find Q3 (the median of the upper half)

Upper half: 5, 5, 5, 5(4th), 6(5th), 6, 6, 7

Middle is between 4th and 5th values = (5 + 6) ÷ 2 = 5.5

So Q3 = 5.5

Step 4: Calculate IQR

IQR = Q3 – Q1 = 5.5 – 3 = 2.5

<ul><li><p>The <strong>range of the middle 50%</strong> of the <u>distribution</u></p><p>FORMULA) <span style="color: yellow">IQR</span> = <span style="color: blue">Q3</span> <strong>-</strong> <span style="color: red">Q1</span></p></li></ul><p>EXAMPLE)</p><p>The data set: 1, 2, 2, 3, 3, 4, 4, <span style="color: green">4</span>(<span style="color: green">8th</span>), <span style="color: green">5</span>(<span style="color: green">9th</span>), 5, 5, 5, 6, 6, 6, 7</p><p><strong>Step 1: Find the Median (Q2)</strong></p><p>There are <strong>16 numbers</strong></p><ul><li><p><strong>Middle position</strong> is <em>between</em> the <span style="color: green">8th</span> and <span style="color: green">9th</span> values</p></li><li><p><span style="color: green">8th</span><strong> value</strong> = <span style="color: green">4</span>, <span style="color: green">9th</span> <strong>value</strong> = <span style="color: green">5</span></p></li><li><p><span style="color: purple">Median</span> = (<span style="color: green">4</span> + <span style="color: green">5</span>) <span style="color: purple">/ 2</span> = <span style="color: purple">4.5</span></p></li></ul><p>So the <strong>dataset</strong> is <em>split into</em> <span style="color: red">lower half</span> (<u>first 8 numbers</u>) and <span style="color: blue">upper half</span> (<u>last 8 numbers</u>).</p><p></p><p><strong>Step 2: Find </strong><span style="color: red">Q1</span><strong> (the median of the lower half)</strong></p><p><span style="color: red">Lower half</span>: 1, 2, 2, <span style="color: green">3</span>(<span style="color: green">4th</span>), <span style="color: green">3</span>(<span style="color: green">5th</span>), 4, 4, 4</p><p>There are <strong>8 numbers</strong></p><ul><li><p><strong>Middle</strong> is <em>between</em> <span style="color: green">4th</span> and <span style="color: green">5th</span> <strong>values</strong> = (<span style="color: green">3</span> + <span style="color: green">3</span>) <span style="color: purple">÷ 2</span> = <span style="color: red">3</span></p></li></ul><p>So <span style="color: red">Q1</span> = <span style="color: red">3</span></p><p></p><p><strong>Step 3: Find </strong><span style="color: blue">Q3</span><strong> (the median of the upper half)</strong></p><p><span style="color: blue">Upper half</span>: 5, 5, 5, <span style="color: green">5</span>(<span style="color: green">4th</span>), <span style="color: green">6</span>(<span style="color: green">5th</span>), 6, 6, 7</p><ul><li><p><strong>Middle</strong> is <em>between</em> <span style="color: green">4th</span> and <span style="color: green">5th</span> <strong>values</strong> = (<span style="color: green">5</span> + <span style="color: green">6</span>) <span style="color: purple">÷ 2</span> = <span style="color: blue">5.5</span></p></li></ul><p>So <span style="color: blue">Q3</span> = <span style="color: blue">5.5</span></p><p></p><p><strong>Step 4: Calculate IQR</strong></p><p><span style="color: yellow">IQR</span> = <span style="color: blue">Q3</span> – <span style="color: red">Q1</span> = <span style="color: blue">5.5</span> – <span style="color: red">3</span> = <span style="color: yellow">2.5</span></p>

New cards

Semi-Interquartile Range

Half of the interquartile range
- Semi-interquartile range = (Q3 - Q1)/2
  Semi-interquartile range = (5.5 -3)/2
Most stable measure of variability
- Less likely to be influenced by extreme scores; does not give a complete picture of the variability for the entire set

New cards

Standard Deviation and Variance 𝜎² / 𝜎: Population

Determine the distance or deviation of each score from the mean
- FORMULA) Deviation score = X - μ
Square each deviation score: (X - μ)²
Sum of the Squared Deviations(SS): SS = ∑(X - μ)²
- The average of these scores is the population variance
- The variance is the average squared distance from the mean
Square root the variance
- Correction for having all the squared distances

<ol><li><p><strong>Determine</strong> the <mark data-color="purple" style="background-color: purple; color: inherit">distance or deviation</mark> of <u>each score from the mean</u></p><ul><li><p>FORMULA) <strong>Deviation score</strong> = X - μ</p></li></ul></li><li><p><strong>Square</strong> <u>each deviation score</u>: (X - μ)²</p></li><li><p><strong>Sum</strong> of the <u>Squared Deviations</u>(<em>SS</em>): SS = ∑(X - μ)²</p><ul><li><p><strong>The average</strong> of <em>these scores</em> is the <mark data-color="purple" style="background-color: purple; color: inherit">population variance</mark></p></li><li><p><strong>The variance</strong> is the <mark data-color="purple" style="background-color: purple; color: inherit">average squared distance from the mean</mark></p></li></ul></li><li><p><strong>Square root the variance</strong></p><ul><li><p><strong>Correction</strong> for <em>having all the </em><mark data-color="purple" style="background-color: purple; color: inherit">squared distances</mark></p></li></ul></li></ol>

New cards

Standard Deviation and Variance s² / s: Sample

Bias: Sample variability is always smaller than population variability
Have to adjust for this bias
- For the mean: Use X̄ instead of μ
- Variability: Computation steps for the numerator are the same
  1. Find the deviation for each score: X - X̄
  2. Square each deviation: (X - X̄)²
  3. Add the squared deviations: 𝑆𝑆 = ∑(X - X̄)²
𝑆𝑆 = ∑(X - X̄)² is the same computation as for the population
Adding the correction for this bias in sample variability, the denominator becomes N-1

<ul><li><p><strong>Bias</strong>: <strong>Sample variability</strong> is <mark data-color="purple" style="background-color: purple; color: inherit;">always smaller than population variability</mark></p></li><li><p>Have to <strong>adjust for this bias</strong></p><ul><li><p>For the <strong>mean</strong>: Use <strong>X̄</strong> <u>instead of</u> <em>μ</em></p></li><li><p><strong>Variability</strong>: Computation steps for the numerator are the same</p><p>1. <strong>Find the deviation</strong> for <u>each score</u>: <strong>X - X̄</strong></p><p>2. <strong>Square</strong> <u>each deviation</u>: <strong>(X - X̄)²</strong></p><p>3. <strong>Add</strong> the <u>squared deviations</u>: <strong>𝑆𝑆 = ∑(X - X̄)²</strong></p></li></ul></li><li><p><strong>𝑆𝑆 = ∑(X - X̄)² </strong>is the <mark data-color="purple" style="background-color: purple; color: inherit;">same computation</mark> as for the <em>population</em></p></li><li><p><strong>Adding</strong> the <u>correction for this bias in sample variability</u>, the <mark data-color="purple" style="background-color: purple; color: inherit;">denominator becomes N-1</mark></p></li></ul><p></p>

New cards

Degrees of Freedom (df)

How many values can you choose freely before the last one is forced by the mean condition

Step 1: How many numbers do we have?

We have 3 numbers: 2, 4, 6
- Find MEAN:
  FIRST ∑X: 2 + 4 + 6 = 12 (this is our sum)
  THEN DIVIDE ∑X (12) by N (3): 12/3 = 4
- MEAN = 4

Step 3: How many values we can have free?

Imagine you have 3 boxes 🟦🟦🟦

You need to put numbers inside them
The rule is: the mean must equal 4

That’s because: MEAN formula needs to make sense

12 (the sum)/3 (the N) = 4

EXAMPLE)

Pick the first number; Say you put 2 in the first box - Nothing stops you, totally free choice
Pick the second number; Say you put 7 in the second box - Again, free choice
Now what about the third box; Now the first two numbers already add up to 2 + 7 = 9, But the total has to equal 12 …
- That means the third box is forced to be 12 − 9 = 3; (not free anymore).

Key idea:

Box 1 = free
Box 2 = free
Box 3 = locked (it must make the total = 12)

So even though you have 3 numbers, only 2 are “free to vary.”

That’s why the degrees of freedom = 2.

New cards

Standard Deviation: Importance

Provides a measure of the typical or standard distance from the mean for a distribution
Allows for interpretation of individual scores
Common measure to describe a set of data
- along with the mean

New cards

Standard Deviation: Transformation

Adding or Subtracting a constant to/from each score
- Not change the standard deviation
  Consider a distribution with μ = 40 and σ = 10 ; Add 5 points to every score

Multiplying or Dividing each score by a constant
- Causes the standard deviation to be multiplied or divided by the same constant
  Consider a distribution with μ = 40 and σ = 10 ; Multiply each score by 2