Confidence Intervals (Part I)

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/49

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

50 Terms

New cards

Normal Distribution

A continuous, symmetric, bell-shaped probability distribution in which values near the mean (μ) occur most frequently and values farther away become increasingly rare.

Fully described by its mean (μ) and standard deviation (σ)

New cards

f(x) = (1/(σ((2π)^1/2)))e^(-((x - μ)^2)/(2σ^2))

Equation for the probability density of a normal distribution.

New cards

Central Limit Theorem (CLT)

A fundamental statistical principle stating that, for a sufficiently large random sample, the distribution of the sample mean (X̄) becomes approximately normal, regardless of the shape of the population distribution.

The mean of this sampling distribution is μ, and its standard deviation (standard error) is σ_X̄ = (σ/((n)^1/2))

New cards

Confidence Interval

An interval around the sample mean that is likely to contains the true population mean.

It expresses how confident we are that μ lies within a given range.

New cards

Sample Mean (X̄)

The arithmetic mean of a random sample.

Under the Central Limit Theorem, it is approximately normally distributed with mean μ and standard deviation (σ_X̄) = (σ/((n)^1/2))

New cards

Fluctuate

(Verb)

Rise and fall irregularly in number or amount.

New cards

Standard deviation of the Sample Mean (σ_(X̄))

Measures how much the sample mean X̄ fluctuates from sample to sample.

New cards

(σ_(X̄)) = (σ/((n)^1/2))

Formula to calculate the Standard Deviation of the sample mean

New cards

95% Confidence Interval for μ

Covers μ for about 95% of all random samples.

The middle 95% of the normal distribution.

New cards

X̄ ± 1.96(σ_(X̄))

Formula for the 95% Confidence Interval for μ

New cards

68% Confidence Interval for μ

Covers μ for about 68% of all random samples; narrower but less reliable than the 95% Confidence Interval.

New cards

X̄ ± 1.00(σ_(X̄))

Formula for the 68% Confidence Interval for μ

New cards

Symbol that represents the population mean.

New cards

A 95% Confidence Interval means that if we repeated sampling many times, about 95% of those intervals would cover μ.

Correct interpretation of the confidence level

New cards

That there is a 95% probability that μ lies in this one interval.

Incorrect interpretation of the confidence level

New cards

Wider Interval (Less Precision)

Higher Confidence Level = ?

New cards

Narrower Interval (More Precision)

Lower Confidence Level = ?

New cards

95% Confidence Interval, 99.7% Confidence Interval.

68% Confidence Interval is narrower than ___________________ which is narrower than _____________________.

New cards

Tail of a Distribution

Refers to the extreme ends of the curve.

The far left and the far right parts

Where values are much smaller or much larger than the mean.

New cards

rare, unlikely outcomes

In a normal distribution, the tails represents ____ or ________________.

New cards

Left Tail

The tail of the normal distribution that contains very low values.

New cards

Right Tail

The tail of the normal distribution that contains very high values.

New cards

The significance level used in confidence interval calculations.

Represents the total area in both tails of the normal distribution that lies outside the confidence interval.

The probability of the interval failing to contain the true population mean (μ)

New cards

α/2

Half of the significance level.

Represents the area in one tail of the normal distribution beyond the confidence limits

New cards

z-score

A standardized value that indicates how many standard deviations a data point or sample mean is from the mean (μ) of a distribution.

New cards

z = ((x - μ)/(σ))

Formula for the z-score for individual values

New cards

z = (X̄ - μ)/(σ_(X̄))

Formula for the z-score of the sample means.

New cards

z_(α/2)

Critical z-score that cuts off an area of α/2 in the right hand tail of the standard normal curve.

It marks the point beyond which α/2 of the probability lies.

New cards

X̄ ± (z_(α/2))(σ_x) = X̄ ± (z_(α/2))(σ/(n)^1/2)

General formula for 100(1 - α)% confidence interval

New cards

Determining the confidence interval of a given interval

If CI = (x, y)

Solve

(y - x) = u

y - u - u = v

z_(α/2) = (y - v)/(σ/((n)^1/2) = z-score

Look at the z-score in the table and you shall get your final answer after taking that value and subtracting it by one.

New cards

reliability

Confidence level measures the ___________ of the method, not the probability that μ is in the interval.

“We are 90% confident that μ is in (14.73, 14.91)” ✅

“The probability that μ is between 14.73 and 14.91 is 90%” ❌

New cards

Higher Confidence (Wider Interval)

Less Precision = ?

New cards

Lower Confidence (Narrower Interval)

More Precision = ?

New cards

95% Confidence Level

Most common confidence level to balance both Higher confidence and lower confidence?

New cards

Concept of Repeated Confidence Intervals

While many confidence intervals are created (one for each rock)
Each interval estimates its rock’s true mean weight.
Even though the interval differ from sample to sample,
They collectively show how the confidence method behaves over many repetitions.

New cards

Yes, it is true

Is it true that,

A 95% confidence level means that if we repeatedly constructed confidence intervals using the same method, about 95% of them will include the true mean (μ). It will measure the reliability of the method and not the probability for a single interval.

New cards

Bernoulli trial, Binomial distribution

For modeling the confidence interval success, each interval can be viewed as a _______________, success if it cover μ, failure if it doesn’t.

When many intervals are created (like 250), the total number follows a _____________________.

New cards

normal distribution

When the number of trials is large, the binomial distribution behaves similarly to a _________________, making it easy to approximate probabilities for many confidence intervals at once.

New cards

confidence

By modeling and approximating, we can calculate the probability that a certain number of intervals will cover μ. This shows how often the __________ method is expected to succeed in practice.

New cards

One-sided Confidence Interval

Used when only one bound (lower or upper) is of interest.

New cards

X̄ - 1.645(σ_X̄)

Formula to calculate the 95% lower bound.

New cards

X̄ + 2.33(σ_X̄)

Formula to calculate the 99% upper bound.

New cards

A one sided 95% bound fails only on one side (5% tail), while a two-sided 95% interval can fail on either side (2.5% + 2.5%). Hence, the one-sided bound is higher.

Why is one-sided bound > two-sided lower limit

New cards

random sample

Formula of X̄ ± (z_(α/2))(σ_(X̄)) is only valid if data are a ____________

New cards

Cyclic Pattern Violation

The patten shown in the repeated process indicating that the sample is not random.

This pattern happens possibly because of the dependence between runs or periodic external effects.

Confidence Interval formula should not be used.

New cards

Trend Violation

Yields increase with time
Indicating that the sample is not random
This happens possibly because of the operator learning effect or due to time drift
In such cases Confidence Interval formula is invalid
Unless only the stable portion of the data is used.

New cards

Random Sample

Sample that is independent and identically distributed. Confidence Interval formula is valid to be used for such samples.

New cards

Non-random Sample

Sample where the time or process trend is present.

Using Confidence Interval formulas for such samples may misrepresent uncertainty.

New cards

Symbol that represents the sample standard deviation in a t-statistic.

New cards

t-score

Measures how many estimated standard errors a sample mean is from the population mean when the population standard deviation (σ) is unknown and the sample size is small (n ≤ 30)