Lecture 7- Describing variation & central tendency

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/29

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 3:22 PM on 3/29/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

30 Terms

1
New cards

What are the traits called that you visible and are either present or not present?

Qualitative, Discontinuous 

Qualitative: Physical Trait

Discontinuous: Exists are 2 or more distinct categories (no in between)

2
New cards

What are some examples of qualitative, discontinuous traits? 

Attached vs detached earlobe, widows peak or no widows peak

3
New cards

What are the scale of measurements commonly used in statistics?

  • Nominal: Qualitative characteristics w/o value, order, or rank (ex: male v. female)

  • Ordinal: Describes categories but difference in rank exists (ex: poor, average, good) 

  • Interval: Numerical data with inherent ordering (300-850 credit score- spaces between the numbers equal, but 0 value is arbitrary, 0 does not mean nothing exists) 

  • Ratio Data: Ordered and numerical distance is rank and can be measured (weight, height, length) (0 means weightless)

4
New cards

Which measurement scales are continuous and which are discontinuous? 

Interval and ratio are continuous and nominal and ordinal are discontinuous (categorical).

5
New cards

Which scale of measurement is used to determine qualitative, discontinuous attributes? 

Nominal: categories don't have inherent rank and they are qualitative! 

6
New cards

What kind of scale of measurement can height be described under? 

Quantitative and continuous (numerical with in between values existing) 

7
New cards

What is the central limit theorem? 

If N is large enough (i.e., ≥ 30), then 

  1. Mean of the sample of measurements (height) should approximate the mean of the population 

  2. Distribution of measurements should be normal (clustered around the mean with no bias to the left or right

<p><span style="background-color: transparent;">If N is large enough (i.e., ≥ 30), then&nbsp;</span></p><ol><li><p><span style="background-color: transparent;">Mean of the sample of measurements (height) should approximate the mean of the population&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Distribution of measurements should be normal (clustered around the mean with no bias to the left or right</span></p></li></ol><p></p>
8
New cards

What is often the case when the n value is large? 

The larger the n value, the better a scatter plot can approximate a normal distribution

<p><span style="background-color: transparent;">The larger the n value, the better a scatter plot can approximate a normal distribution</span></p>
9
New cards

What is the mean, median, and mode referred to as? 

Measures of central tendency

10
New cards

What happens to the mean, median, and mode when your frequency distribution is normally distributed

The mean, median, and mode will be similar

<p><span style="background-color: transparent;">The mean, median, and mode will be similar</span></p>
11
New cards

How do you calculate mean, median, and mode

Mean: Sum of numbers / total numbers

Median: Sort the numbers from smallest to largest and find the middle number (if two numbers in the middle, find the average of the two)

Mode: Value that appears the most frequently in a data set

12
New cards

What is the Law of Large Numbers?

  • As you increase n, parameters describing the sample (mean) are less likely to be affected by random variables.

  • Fluctuations in the data and outliers will cancel out if you have a larger n value

  • Shows that importance of having a large sample size

13
New cards

What is the measure of central tendency that is less vulnerable to outliers?

The median- even if an outlier value exists, the value of the median will remain the same. The mean is at risk of changing because of the outliers.

14
New cards

When is the mean a valid measure of central tendency?

  • 1) Distribution is normal (symmetrical, bell shaped)

  • 2) Median and mode are roughly the same

<ul><li><p>1) Distribution is  normal (symmetrical, bell shaped) </p></li><li><p>2) Median and mode are roughly the same </p></li></ul><p></p>
15
New cards

How do you find residual variation of the mean?

1) (Reported- mean height of all reported) = residual

2) Residuals² = (residual)²

3) Sum of Squares= sum of all (residual)²

<p>1) (Reported- mean height of all reported) = residual</p><p>2) Residuals² = (residual)²</p><p>3) Sum of Squares= sum of all (residual)²</p>
16
New cards

How do you calculate population variance around the mean?

(σ²) = SS/ n (if you measure the ENTIRE population)

σ² = Population Variance

SS = Sum of Squares

n = population

<p>(<strong>σ²)</strong> = SS/ n (if you measure the ENTIRE population)</p><p>σ² = Population Variance</p><p>SS = Sum of Squares </p><p>n = population</p>
17
New cards

How do you measure sample variance around the mean?

= SS/ (n-1) {1 degree of freedom is taken away because you are making an ESTIMATE for the entire population}

s² = Sample Variance

SS = Sum of Squares

n = population

<p><strong>s² </strong>= SS/ (n-1) {1 degree of freedom is taken away because you are making an ESTIMATE for the entire population}</p><p>s² = Sample Variance</p><p>SS = Sum of Squares</p><p>n = population</p>
18
New cards

How do you calculate sample variance with an outlier in the data set?

Same as you would calculate it for a sample variance , just keep in mind that when n is small, the outliers can dramtically distort the estimates of variation.

<p>Same as you would calculate it for a sample variance , just keep in mind that when n is small, the outliers can dramtically distort the estimates of variation.</p>
19
New cards

What is the drawback to using S² as a measure of variation?

Squared units can be difficult to interpret, so the solution is to represent the variation in magnitude as SD.

SD= √s²

SD/ s = Standard Deviations

s² = Sample Variance

SD can give a measure of the noise in a given measurement

<p>Squared units can be difficult to interpret, so the solution is to represent the variation in magnitude as SD.</p><p>SD= √s²</p><p>SD/ s = Standard Deviations</p><p>s² = Sample Variance</p><p>SD can give a measure of the noise in a given measurement</p>
20
New cards

What are the characteristics of SE or SEM that make it more preferable?

1) SE is smaller than SD

2) SE decreases with increasingly sample size

21
New cards

What is the problem with both SE or SEM (standard error) and SD/s (standard deviation) and how can you fix it

They tend to increase with the size of the mean. You can fix this by dividing the the SD (or s) by the mean to get:

CV= ( s / X ) (100)

CV= Relative Variation

s= Standard Deviation

X= mean

<p>They tend to increase with the size of the mean. You can fix this by dividing the the SD (or s) by the mean to get:</p><p>CV= ( s / X ) (100)</p><p>CV= Relative Variation</p><p>s= Standard Deviation</p><p>X= mean</p>
22
New cards

What is another way to write Standard Deviation (SD)?

s (lowercase)

23
New cards

What is another way to write Standard Error (SE)

SEM (Standard Error of Means)

24
New cards

What is the confidence Interval (CI) and what are the benefits to using it?

Range of values that contain the true value of an unknown population parameter

Virtue: Bars can directly illustrate if the two means differ significantly

Downside: Bars are large and make data look noisy

25
New cards

How do you find 95% CI?

95% CI = (Z score) (standard error)

26
New cards

What are some commonly used z scores for confidence intervals?

95% CI = (Z score) (standard error)

27
New cards

Which means differ significantly from one another?

No overlap of error bars = significant difference

D with B, C, and E with A

  • A vs B → significant

  • A vs E → significant

  • D vs B → significant

  • D vs E → significant

28
New cards

Which outcomes differ significantly at the 0.05 level from 1?

The baseline is 1

  • Stroke is significant because it is above 1

  • Break Cancer is significant because it is above 1

  • Hip Fracture is significant because it is below 1

  • Endometrial Cancer is not significant because it crosses. 1 (no effect line)

29
New cards

What is used to represent data that lacks normal distribution of scores?

Box and Whisker Plot (numerical data with the median interquartiles and min max values)

30
New cards

What is a better way to represent data values?

Present the raw values, turning into a bar graph obscures that nature of the data. Also, presenting the responses for each experimental unit as is provides greater transparency.

<p>Present the raw values, turning into a bar graph obscures that nature of the data. Also, presenting the responses for each experimental unit as is provides greater transparency. </p>

Explore top notes

note
AP Gov: Chapter 1 Vocab
Updated 1299d ago
0.0(0)
note
Unit 7: Evolution (Biology)
Updated 710d ago
0.0(0)
note
Mitosis
Updated 751d ago
0.0(0)
note
diplomacy
Updated 1251d ago
0.0(0)
note
Le Chatelier's Principle
Updated 1158d ago
0.0(0)
note
AP Gov: Chapter 1 Vocab
Updated 1299d ago
0.0(0)
note
Unit 7: Evolution (Biology)
Updated 710d ago
0.0(0)
note
Mitosis
Updated 751d ago
0.0(0)
note
diplomacy
Updated 1251d ago
0.0(0)
note
Le Chatelier's Principle
Updated 1158d ago
0.0(0)

Explore top flashcards

flashcards
Kite Runners Vocab
35
Updated 68d ago
0.0(0)
flashcards
Midterm
238
Updated 389d ago
0.0(0)
flashcards
Describing People SPN2 PreAP
91
Updated 221d ago
0.0(0)
flashcards
Cellular Respiration Review
22
Updated 1215d ago
0.0(0)
flashcards
apush 3.1-3.4
52
Updated 578d ago
0.0(0)
flashcards
Chapter 9 Med Term
25
Updated 1217d ago
0.0(0)
flashcards
Posterior Muscles
47
Updated 1244d ago
0.0(0)
flashcards
SALUD Y BIENESTAR vocabulario
33
Updated 913d ago
0.0(0)
flashcards
Kite Runners Vocab
35
Updated 68d ago
0.0(0)
flashcards
Midterm
238
Updated 389d ago
0.0(0)
flashcards
Describing People SPN2 PreAP
91
Updated 221d ago
0.0(0)
flashcards
Cellular Respiration Review
22
Updated 1215d ago
0.0(0)
flashcards
apush 3.1-3.4
52
Updated 578d ago
0.0(0)
flashcards
Chapter 9 Med Term
25
Updated 1217d ago
0.0(0)
flashcards
Posterior Muscles
47
Updated 1244d ago
0.0(0)
flashcards
SALUD Y BIENESTAR vocabulario
33
Updated 913d ago
0.0(0)