Variability

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/9

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards

Spread and Variability

  • Variability - used to determine how spread out a group of scores are, or if they are bunched together

    • Synonymous (nearly the same) with terms spread and dispersion

  • While central tendency describes the center of the distribution, variability describes the divergence (gap/separation) from the center

The scores are the same = no variability

A small difference between scores = small variability

A large difference between scores = large variability

<ul><li><p><strong>Variability</strong> - <u>used to determine</u> how <mark data-color="purple" style="background-color: purple; color: inherit">spread out a group of scores are</mark>, <em>or if they are</em> <mark data-color="purple" style="background-color: purple; color: inherit">bunched together</mark></p><ul><li><p>Synonymous (nearly the same) with terms spread and dispersion</p></li></ul></li><li><p>While central tendency describes the center of the distribution, <mark data-color="purple" style="background-color: purple; color: inherit">variability describes the divergence</mark> (<em>gap/separation</em>) <mark data-color="purple" style="background-color: purple; color: inherit">from the center</mark></p></li></ul><p>The <span style="color: green">scores</span> are the <span style="color: green">same</span> = <span style="color: green">no variability</span></p><p>A <span style="color: blue">small difference</span> between <strong>scores</strong> = <span style="color: blue">small variability</span></p><p>A <span style="color: red">large difference</span> between <strong>scores</strong> = <span style="color: red">large variability</span></p>
2
New cards

Measuring Variability

  • Can utilise a few difference measures to describe variability:

    • Describes if scores are clustered or spread over distance

    • informs us of how well a score represents the entire distribution

  • We do this through:

    • Range

    • Interquartile Range

    • Standard Deviation

3
New cards

Range

  • distance between the largest score (Xmax) and the smallest score (Xmin)

FORMULA)

Range = URL Xmax - LRL Xmax

URL = Upper Real Limit

LRL = Lower Real Limit

<ul><li><p><strong>distance between</strong> the <span style="color: blue">largest score</span> (<em>Xmax</em>) and the <span style="color: red">smallest score</span> (<em>Xmin</em>)</p></li></ul><p></p><p>FORMULA)</p><p>Range = URL <span style="color: blue">Xmax</span> - LRL <span style="color: red">Xmax</span></p><p></p><p>URL = Upper Real Limit</p><p>LRL = Lower Real Limit</p>
4
New cards

Interquartile Range (IQR)

  • The range of the middle 50% of the distribution

    FORMULA) IQR = Q3 - Q1

EXAMPLE)

The data set: 1, 2, 2, 3, 3, 4, 4, 4(8th), 5(9th), 5, 5, 5, 6, 6, 6, 7

Step 1: Find the Median (Q2)

There are 16 numbers

  • Middle position is between the 8th and 9th values

  • 8th value = 4, 9th value = 5

  • Median = (4 + 5) / 2 = 4.5

So the dataset is split into lower half (first 8 numbers) and upper half (last 8 numbers).

Step 2: Find Q1 (the median of the lower half)

Lower half: 1, 2, 2, 3(4th), 3(5th), 4, 4, 4

There are 8 numbers

  • Middle is between 4th and 5th values = (3 + 3) ÷ 2 = 3

So Q1 = 3

Step 3: Find Q3 (the median of the upper half)

Upper half: 5, 5, 5, 5(4th), 6(5th), 6, 6, 7

  • Middle is between 4th and 5th values = (5 + 6) ÷ 2 = 5.5

So Q3 = 5.5

Step 4: Calculate IQR

IQR = Q3Q1 = 5.53 = 2.5

<ul><li><p>The <strong>range of the middle 50%</strong> of the <u>distribution</u></p><p>FORMULA) <span style="color: yellow">IQR</span> = <span style="color: blue">Q3</span> <strong>-</strong> <span style="color: red">Q1</span></p></li></ul><p>EXAMPLE)</p><p>The data set: 1, 2, 2, 3, 3, 4, 4, <span style="color: green">4</span>(<span style="color: green">8th</span>), <span style="color: green">5</span>(<span style="color: green">9th</span>), 5, 5, 5, 6, 6, 6, 7</p><p><strong>Step 1: Find the Median (Q2)</strong></p><p>There are <strong>16 numbers</strong></p><ul><li><p><strong>Middle position</strong> is <em>between</em> the <span style="color: green">8th</span> and <span style="color: green">9th</span> values</p></li><li><p><span style="color: green">8th</span><strong> value</strong> = <span style="color: green">4</span>, <span style="color: green">9th</span> <strong>value</strong> = <span style="color: green">5</span></p></li><li><p><span style="color: purple">Median</span> = (<span style="color: green">4</span> + <span style="color: green">5</span>) <span style="color: purple">/ 2</span> = <span style="color: purple">4.5</span></p></li></ul><p>So the <strong>dataset</strong> is <em>split into</em> <span style="color: red">lower half</span> (<u>first 8 numbers</u>) and <span style="color: blue">upper half</span> (<u>last 8 numbers</u>).</p><p></p><p><strong>Step 2: Find </strong><span style="color: red">Q1</span><strong> (the median of the lower half)</strong></p><p><span style="color: red">Lower half</span>: 1, 2, 2, <span style="color: green">3</span>(<span style="color: green">4th</span>), <span style="color: green">3</span>(<span style="color: green">5th</span>), 4, 4, 4</p><p>There are <strong>8 numbers</strong></p><ul><li><p><strong>Middle</strong> is <em>between</em> <span style="color: green">4th</span> and <span style="color: green">5th</span> <strong>values</strong> = (<span style="color: green">3</span> + <span style="color: green">3</span>) <span style="color: purple">÷ 2</span> = <span style="color: red">3</span></p></li></ul><p>So <span style="color: red">Q1</span> = <span style="color: red">3</span></p><p></p><p><strong>Step 3: Find </strong><span style="color: blue">Q3</span><strong> (the median of the upper half)</strong></p><p><span style="color: blue">Upper half</span>: 5, 5, 5, <span style="color: green">5</span>(<span style="color: green">4th</span>), <span style="color: green">6</span>(<span style="color: green">5th</span>), 6, 6, 7</p><ul><li><p><strong>Middle</strong> is <em>between</em> <span style="color: green">4th</span> and <span style="color: green">5th</span> <strong>values</strong> = (<span style="color: green">5</span> + <span style="color: green">6</span>) <span style="color: purple">÷ 2</span> = <span style="color: blue">5.5</span></p></li></ul><p>So <span style="color: blue">Q3</span> = <span style="color: blue">5.5</span></p><p></p><p><strong>Step 4: Calculate IQR</strong></p><p><span style="color: yellow">IQR</span> = <span style="color: blue">Q3</span> – <span style="color: red">Q1</span> = <span style="color: blue">5.5</span> – <span style="color: red">3</span> = <span style="color: yellow">2.5</span></p>
5
New cards

Semi-Interquartile Range

  • Half of the interquartile range

    • Semi-interquartile range = (Q3 - Q1)/2

      Semi-interquartile range = (5.5 -3)/2

  • Most stable measure of variability

    • Less likely to be influenced by extreme scores; does not give a complete picture of the variability for the entire set

6
New cards

Standard Deviation and Variance 𝜎² / 𝜎: Population

  1. Determine the distance or deviation of each score from the mean

    • FORMULA) Deviation score = X - μ

  2. Square each deviation score: (X - μ)²

  3. Sum of the Squared Deviations(SS): SS = ∑(X - μ)²

    • The average of these scores is the population variance

    • The variance is the average squared distance from the mean

  4. Square root the variance

    • Correction for having all the squared distances

<ol><li><p><strong>Determine</strong> the <mark data-color="purple" style="background-color: purple; color: inherit">distance or deviation</mark> of <u>each score from the mean</u></p><ul><li><p>FORMULA) <strong>Deviation score</strong> = X - μ</p></li></ul></li><li><p><strong>Square</strong> <u>each deviation score</u>: (X - μ)²</p></li><li><p><strong>Sum</strong> of the <u>Squared Deviations</u>(<em>SS</em>): SS = ∑(X - μ)²</p><ul><li><p><strong>The average</strong> of <em>these scores</em> is the <mark data-color="purple" style="background-color: purple; color: inherit">population variance</mark></p></li><li><p><strong>The variance</strong> is the <mark data-color="purple" style="background-color: purple; color: inherit">average squared distance from the mean</mark></p></li></ul></li><li><p><strong>Square root the variance</strong></p><ul><li><p><strong>Correction</strong> for <em>having all the </em><mark data-color="purple" style="background-color: purple; color: inherit">squared distances</mark></p></li></ul></li></ol>
7
New cards

Standard Deviation and Variance s² / s: Sample

  • Bias: Sample variability is always smaller than population variability

  • Have to adjust for this bias

    • For the mean: Use instead of μ

    • Variability: Computation steps for the numerator are the same

      1. Find the deviation for each score: X - X̄

      2. Square each deviation: (X - X̄)²

      3. Add the squared deviations: 𝑆𝑆 = ∑(X - X̄)²

  • 𝑆𝑆 = ∑(X - X̄)² is the same computation as for the population

  • Adding the correction for this bias in sample variability, the denominator becomes N-1

<ul><li><p><strong>Bias</strong>: <strong>Sample variability</strong> is <mark data-color="purple" style="background-color: purple; color: inherit;">always smaller than population variability</mark></p></li><li><p>Have to <strong>adjust for this bias</strong></p><ul><li><p>For the <strong>mean</strong>: Use <strong>X̄</strong> <u>instead of</u> <em>μ</em></p></li><li><p><strong>Variability</strong>: Computation steps for the numerator are the same</p><p>1. <strong>Find the deviation</strong> for <u>each score</u>: <strong>X - X̄</strong></p><p>2. <strong>Square</strong> <u>each deviation</u>: <strong>(X - X̄)²</strong></p><p>3. <strong>Add</strong> the <u>squared deviations</u>: <strong>𝑆𝑆 = ∑(X - X̄)²</strong></p></li></ul></li><li><p><strong>𝑆𝑆 = ∑(X - X̄)² </strong>is the <mark data-color="purple" style="background-color: purple; color: inherit;">same computation</mark> as for the <em>population</em></p></li><li><p><strong>Adding</strong> the <u>correction for this bias in sample variability</u>, the <mark data-color="purple" style="background-color: purple; color: inherit;">denominator becomes N-1</mark></p></li></ul><p></p>
8
New cards

Degrees of Freedom (df)

How many values can you choose freely before the last one is forced by the mean condition

Step 1: How many numbers do we have?

  • We have 3 numbers: 2, 4, 6

    • Find MEAN:

      FIRST ∑X: 2 + 4 + 6 = 12 (this is our sum)

      THEN DIVIDE ∑X (12) by N (3): 12/3 = 4

    • MEAN = 4

Step 3: How many values we can have free?

Imagine you have 3 boxes 🟦🟦🟦

  • You need to put numbers inside them

  • The rule is: the mean must equal 4

That’s because: MEAN formula needs to make sense

12 (the sum)/3 (the N) = 4

EXAMPLE)

  1. Pick the first number; Say you put 2 in the first box - Nothing stops you, totally free choice

  2. Pick the second number; Say you put 7 in the second box - Again, free choice

  3. Now what about the third box; Now the first two numbers already add up to 2 + 7 = 9, But the total has to equal 12

    • That means the third box is forced to be 12 − 9 = 3; (not free anymore).

Key idea:

  • Box 1 = free

  • Box 2 = free

  • Box 3 = locked (it must make the total = 12)

So even though you have 3 numbers, only 2 are “free to vary.”

That’s why the degrees of freedom = 2.

9
New cards

Standard Deviation: Importance

  1. Provides a measure of the typical or standard distance from the mean for a distribution

  2. Allows for interpretation of individual scores

  3. Common measure to describe a set of data

    • along with the mean

10
New cards

Standard Deviation: Transformation

  • Adding or Subtracting a constant to/from each score

    • Not change the standard deviation

      Consider a distribution with μ = 40 and σ = 10 ; Add 5 points to every score

  • Multiplying or Dividing each score by a constant

    • Causes the standard deviation to be multiplied or divided by the same constant

      Consider a distribution with μ = 40 and σ = 10 ; Multiply each score by 2