Statistics 1 - Final

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/29

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

30 Terms

New cards

Describe the shape, center, and spread of the sampling distribution of the sample mean when the distribution is normal.

Shape: normal regardless of sample size

Center: mean (xbar) = mean of population

Spread: standard deviation(xbar) = standard deviation/root(n)

New cards

Describe the shape, center, and spread of the sampling distribution of the sample mean when the distribution is not normal.

Shape: as the sample size increases, the distribution of the sample mean becomes normal

Center: mean (xbar) = mean of population

Spread: standard deviation(xbar) = standard deviation/root(n)

New cards

Central limit theorem

Regardless of the shape of the underlying population, the sampling distribution of xbar becomes approximately normal as the sample size, n, increases (The distribution of the sample mean is normal when n >= 30)

New cards

Describe the shape, center, and spread of the sample proportion (p-hat)

Shape: as sample size increases, shape becomes normal provided np(1-p) >= 10

Center: mean of the distribution of the sample proportion = population proportion (p)

Spread: as the sample size increases, the standard deviation decreases

New cards

Confidence interval

a range of values that is likely to contain a population parameter, providing an estimate of its uncertainty

New cards

Level of confidence

the percentage of times a confidence interval will contain the true population parameter if the study were repeated many times

New cards

Margin of error

the range of uncertainty or variability around an estimate or measurement

New cards

Critical value

a threshold in statistics that helps determine if your results are statistically significant

New cards

T-distribution

number of standard deviations away from the middle of a sample mean when the population standard deviation is unknown and the sample size is small

New cards

What are the 6 rules of a t-distribution?

The t-distribution is different for different degrees of freedom
Centered around 0 and symmetric about 0
Area under the curve is 1. The area under the curve to the right of 0 equals the area under the curve to the left of 0, which equals ½.
As t increases or decreases without bound, the graph approaches, but never equals zero
The area in the tails of the t-distribution is a little greater than the area in the tails of the standard normal distribution, because we are using s as an estimate of σ, thereby introducing further variability into the t-statistic
As the sample size n increases, the density curve of t gets closer to the standard normal density curve. This result occurs because, as the sample size n increases, the values of s get closer to the values of σ, by the Law of Large Numbers

New cards

t-interval

a statistical tool that estimates a range of likely values for an unknown population mean, especially when the sample size is small and the population's standard deviation is unknown

New cards

What is the criteria for a t-interval

the sample is random
the sample is small relative to the population size (n<=0.05N)
is normally distributed
does not have outliers
sample size is <30

New cards

Chi-Squared test

a test used to determine if there is a significant difference between observed and expected frequencies for categorical data

New cards

Criteria for a chi-squared test

Not symmetric
Depends on degrees of freedom
As the degree of freedom increases, the chi-square becomes nearly symmetric
Always positive

New cards

What are the 4 outcomes of hypothesis testing?

Reject the null hypothesis when the alternative hypothesis is true. This decision would be correct.
Do not reject the null hypothesis when the null hypothesis is true. This decision would be correct
Reject the null hypothesis when the null hypothesis is true. This decision would be incorrect. This type of error is called a Type I error
Do not reject the null hypothesis when the alternative hypothesis is true. This decision would be incorrect. This type of error is called a Type II error.

New cards

Null hypothesis (H0)

no change/effect/difference, assumed true until evidence indicates otherwise

New cards

Alternative hypothesis (H1)

alternative to the null hypothesis

New cards

What are the 3 tests for a null hypothesis?

Equal vs unequal hypothesis (two-tailed)

H0 = some value

H1 != some value

Equal vs less-than (left-tailed)
H0 = some value
H1<some value
Equal vs greater than (right-tailed)

H0 = some value

H1 > some value

New cards

Level of signficance (alpha)

the probability of making a Type I error

New cards

How do you calculate the standard deviation of a sample proportion

Use formula: standard deviation = root(p * (1-p)/n)

New cards

How do you calculate a point estimate with an upper and lower bound?

Point estimate = (lower bound + upper bound)/2

New cards

How do you calculate a confidence interval?

Calculate sample proportion (p-hat) = x/n
Determine critical value (z a/2) for given confidence level -> a/2 = (1-confidence level)/2 -> find z-score such that the area to the right is a/2
Calculate the margin of error: E = z(a/2) * root(p-hat x (1-p)/n)
Calculate lower bound -> lower bound = p-hat - E
Calculate the upper bound -> upper bound = p-hat + E

New cards

How do you calculate the level of confidence?

Level = (1-alpha) x 100%, where alpha is the error (or what is outside of that interval

New cards

How do you calculate the margin of error with upper/lower bounds?

Margin of error = (Upper bound - lower bound)/2

New cards

How do you calculate how large a sample needs to be?

Calculate p-hat: (p-hat) = x/n
Find Z(a/2) such that a/2 = (1-confidence level)/2
Calculate the margin of error (use either formula depending on what’s given)
Plug all values into the formula: n = (p-hat * (1 - p-hat))(z(a/2)/E)²

New cards

How do you calculate a T-distribution?

t = (x-bar - mean)/(s/root(n))

New cards

How do you construct a T-interval?

Determine degrees of freedom: df = n-1
Calculate a/2: a/2 = (1 - confidence interval)/2
Find t-value for df and a/2: use the table
Calculate margin of error: E = t-value x (s/root(n))
Calculate lower bound: x-bar - E
Calculate upper bound: x-bar + E

New cards

How do you calculate the margin of error for a t-interval?

Find t(a/2) where a/2 = (1-confidence level)/2
Plug into the formula: E = t(a/2) x (s/root(n))

New cards

How do you calculate a chi-squared value?

X² = ((n-1)s²)/(standard deviation)²

New cards

How do you construct a confidence interval for (standard deviation)²?

Determine degree of freedom: df = n-1
Find lower tail chi-squared critical value: a/2 = (1-confidence level)/2
Find upper tail chi-squared critical value: 1 - (a/2)
Find chi-squared critical values: use the table for lower and upper tails
Calculate the lower bound of the confidence interval: ((n-1)s²)/(X(a/2)²
Calculate the upper bound of the confidence interval: ((n-1)s²)/(X(1 - a/2)²
If asking for standard deviation, square root results